LOOKING FOR CONTINUOUS LOCAL MARTINGALES 
WITH THE CROSSING TREE (WORKING PAPER) 



OWEN D. JONES AND DAVID A. ROLLS 



Abstract. We present statistical tests for the continuous martingale hypothesis. That is, 
whether an observed process is a continuous local martingale, or equivalently a continuous 
time-changed Brownian motion. Our technique is based on the concept of the crossing tree. 
Simulation experiments are used to assess the power of the tests, which is generally higher 
than recently proposed tests using the estimated quadratic variation (i.e., realised volatility). 
In particular, the crossing tree shows significantly more power with shorter datasets. We then 
show results from applying the methodology to high frequency currency exchange rate data. 
We show that in 2003, for the AUD-USD, GBP-USD, JPY-USD and EUR-USD rates, at small 
timescales (less than 15 minutes or so) the continuous martingale hypothesis is rejected, but not 
so at larger timescales. For 2003 EUR-GBP data, the hypothesis is rejected at small timescales 
and some moderate timescales, but not all. 
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1. Introduction 



Time-changed Brownian motions have been proposed as models where so-called 'volatility 
clustering' or 'intermittency' is observed, in particular in finance but notably also in turbu- 
lence and telecommunications. Models that incorporate time-changed Brownian motion (pos- 
sibly after taking logs and removing drift) include, for exam ple, stochastic volatil ity models 
( Hull and White . 19871 ); infinitely divisible ca scading motio n ( Chainais et all 20031 ) and frac- 
tal activity time geometric Brownian motion ( Hevde . 19991 ). In what follows we consider the 
question of testing whether or not a given process X can be considered to be of the form B o 9 
where B is Brownian motion and 9 is a continuous non-decreasing process, possibly dependent 
on B. 

From the theory of martingales one can obtain a hierarchy of time-changed Brownian motions. 
In what follows we use the terminology subordinator for a non-decreasing process with stationary 
independent increments, which is thus a pure-jump Levy process, and chronometer for a general 
non-decreasing process. We always take a 'time-change' to be with respect to a non-decreasing 
process, possibly dependent on the past but not on the future. Let B stand for Brownian motion 
and 9 for a chronometer, defined on the same filtration {J-{\, then: 

(1) (iMonrod d 1978ft \ We can write X = B o 9 iff X is a se mimartingale. 

(2) (|Dambisl (|l965T ). iDubins and Schward (|l965T ). iKaratzas and Shrevd f|l99lh . 

iRevuz and Yon (jl999l . Proposition V.1.5) We can write X f = B o 9 + X with 8 

continuous iff X is a continuous local martingale, in which case 9 ^= (X) (the quadratic 
variation process). 

(3) (jOcond (|l993l ^ We can write X = B o 9 with 9 continuous and independent of B 
iff X is a continuous local martingale such that for any predictable H with \H\ = 1, 



f H(s)dX(s) f ^X. 



fdd 



(4) dChernv and Shirvaevl ([2002)) We can write X J = B o 9 with 9 a subordinator in- 
dependent of B iff X is a Levy process with drift and symmetric Levy measure 
v{A) = J A q(x)dx where q(\/x) is completely monotone. 

From the results of Dambis, Dubins & Schwarz, and Revuz & Yor, testing that X = B o 9 
for 9 a continuous chronometer is equivalent to testing whether or not X is a continuous local 
martingale. Note that a time-changed Brownian motion can be continuous even though the 
chronometer is discontinuous. This is the class of continuous semimartingales that are not local 
martingales, which includes for example Brownian motion with drift; the Ornstein-Uhlenbeck 
process (the Vasicek model) and Feller's square root process (the Cox, Ingersoll and Ross model). 
When we write 'continuous time-changed Brownian motion', X = B o 9, we mean that 9 (and 
thus X) is continuous. 

There is a substantial literature about the problem of testing whether or not a discrete time 
process has stationary martingale differences, ge nerall y in the context of model verification. 
A seminal paper in this ar ea is that of Bierens (ll984T ) and a useful discussion can be found 
in Dominguez and Lobato (|200lh . It should be noted that many authors who claim to test 
if a discrete time series has stationary martingale differences actually test the less restrictive 
hypothesis that the differences are stationary and uncor related; most t ests for zero correlation 
can be traced back to those of Box and Pierce (|l970h . Durlauf (|l99ll ) or Lo and MacKinlay 
( 19881 ) . The present paper takes a different approach to the discrete time literature, in part 
because we do not include stationary increments in our null hypothesis, but more fundamentally 
because there are differences between the behaviour of continuous and discontinuous martingales 
which clearly do not arise in the discrete time case. 

In the econometrics literature the hypothesis that a discrete timeseries has stationary martin- 
gale differences is called the martingale hypothesis. By analogy, we will call the hypothesis that 
a continuous process is a continuous local martingale the continuous martingale hypothesis. We 
are careful to distinguish between the continuous martingale hypothesis and the less restrictive 
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hypotheses that a process is a local martingale (possibly discontinuous) or a semimartingale, 
continuous or otherwise (Brownian motion with a possibly discontinuous time change). 

Recently there has been a lot of interest in using high frequency financial data to estimate 
quadratic variation (realised volatility) as an estimator of 'integrated variance'. The goal is 
to view financial returns (e.g. currency, metal and stoc k index spot and future s price s) in 
'financial time' rathe r than 'calendar' time. (See both Maasoumi and McAleer (2008) and 
McAleer and Marcelol (120081) and the papers therein for a review). In the context of foreign 



exchange rates, Andersen et al.l ( 2000l ) showed daily exchange rate returns normalised by daily 
realised volatilities (formed by summing 30 minute volatilities) look approximately Gaussian. 
Sometimes formal statistical testing that data is from a tim e-cha nged Brownian motio n has 
been included (e. g., A ndersen et al., 120031 : Park & Vasudev, 120051 : Peters & de Vilder, 120061 : 



Andersen et al., 20071 ). To test the hypothesis that X = B o 6, where is a continuous 



chronometer, one can estimate 6 from the observed quadratic variation and then test whether the 
increments of X o fl -1 , look like the incre ments of Brownian motion. Park and Vasudev (2005) 
(and more recently Andersen et al. ( 20071 )) also include empirical power estimates for sim ulated 
processes, which facilitate direct comparisons with other approaches. Park & Vasudev ( 20051 ) 
is distinguished as the only systematic study of the power of the quadratic variation app roach . 
Non-quadratic variation tests have also been proposed using moments by Ane & Ge man (|2000| ) 
and using a combination of excursions from zero and local time at zero by Guasoni (200J). 

This quadratic variation/realised volatility approach faces several challenges. First, a good 
estimate of the quadratic variation is required. It has been shown that market microstructure 
noise can lead the sample quadratic variati on to be a biased estimator, w ith bias that grows lin- 
early as the sampling frequency increases ( McAleer and Marcelol . 20081 ). Second, for statistical 
testing, one must make an arbitrary choice for the increment length in X o d . As we show, 
this choice can affect the results of the test. 

Our test of the co ntinuous martingale hy pothesis is based on the recently introduced concept 
of the crossing tree (Jones and Shcn, 2004) and a characterisation of continuous local martin- 
gales in terms of it (jJones and Rolls! . 12008 ) . It is closely tied to the idea of first passage times 
and avoids the need to estimate the quadratic variation, and the problems inherent in that 
exercise. Using simulation it shows discriminatory power as good, or better, than the quadratic 
variation method for a range or processes. Unlike alternatives, our approach has the advantage 
of testing a range of timescales. Thus, our method picks out the scales at which the continuous 
martingale hypothesis is plausible. We also report results from testing foreign exchange rate tick 
data from 2003 for five rates (AUD-USD, GBP-USD, JPY-USD, EUR-USD and EUR-GBP). 
We show that for timescales tested on the order of about fifteen minutes or less the continuous 
martingale hypothesis is rejected, while it is not rejected at longer timescales. 

In Section [2] we describe our implementation of the quadratic variation approach, which 
provides a basis of comparison for our crossing tree approach. Further, our implementation 
sidesteps m aking an arbitrary choice of increment length. Notably, the choice used by Park 
& Vasudev ([2005 ) seems to cause the power of th e ir tes t to suffer. In Section [3] we describe 
the crossing tree and restate from Jones and Rolls ( 20081 ) the characterisation of a continuous 
local martingale. We use this characterisation in combination with a suite of tests to test 
the continuous martingale hypothesis. In Section H] we show results from a simulation study 
which shows our crossing tree approach is generally more powerful than the quadratic variation 
approach. We also show the choice of parameter inherent to the quadratic variation-based test 
can make a large difference to the power of the test. Finally in Section Owe give the results 
from applying the crossing tree test to various foreign exchange rates. We give conclusions in 
Section [6j Matlab code for creating the crossing tree for an observed process is available at 
www . ms . unimelb . edu . au/~odj . 
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2. Quadratic Variation-based Test 



By Dambis ( 19651 ) and Dubins and Schwarz ( 19651 ) (see also Karatzas and Shreve ( 1991 



Theorem 4.6)) we can write X — Xq ^= B o with 6 continuous iff X is a continuous local 
martingale, in which case 9 = (X) (the quadratic variation process). Thus, with the inverse 

T t = mf{s > : (X) s > t}, 
one tests if X is a continuous local martingale by testing if 
(1) Y t = X Tt 

is standard Brownian motion. Peters and D e Vild er ( 20061 ) discuss the technique and apply 



it to S & P 500 returns. Park and Vasudev ( 20051 ) go further, providing a number of results 



for simulated data, futures prices and currency spot prices. In this section we discuss our 
implementation of the quadratic variation test. There are two parts to the test, estimation of 
the quadratic variation, and testing for Brownian motion, and both will be dis cussed . 

For ease of comparison our notation closely follows that of Park and Vasudev ( 20051 ). Suppose 



{Xi, . . . , X n } are n consecutive values from a time interval (0, nS] with time interval 5 between 
observations. The quadratic variation is estimated as 

( x )t = { XkS ~ x (k-i)s) > t>25. 

k>2:kS<t 

Define its generalised inverse as if = inf{s > : (X) s s > t} and introduce the analogue of (pQ) 
with Y t s = X T s . For intervals of length A form the sequence 

(2) z\ = JL (r (i+1)A _ YjA ) , j = 1, 2, . . . N - 1 



where N is such that = n5. Under the null hypothesis that the observations are consecutive 
values from a continuous local martingale, {Zj} is approximately an i.i.d sequence of iV(0, 1) 
values. What remains is to choose a value for A and then test that {Zj} is an i.i.d sequence 
with a common N(0, 1) distribution. Note that A selection is not an issue for estimating the 
quadratic variation, but rather for using it to normalise and recover Brownian motion. 

2.1. A Selection. There is freedom in the choice of A. Clearly, smaller A allows larger iV and 
so more values Zj to test. However, small A introduces interpolation error as A — > 0. Since the 
quadratic variation is estimated with some granularity through 5, under the null hypothesis 

Y (j+1)A ~ Y jA B (j+i)A ~ B jA + 0(VS) 



A 



and small A inflates the error term. Park and Vasudev (|2005l ) provide a lower bound for 



A as a function of N and 6 but it is of limited use since it depends on unknown constants. 
Effectively they search for A so that the empirical distribution of {Zj — Zf^} with Z^ N = 

E^Ji 1 Zj f{N — 1) is closest to the N(0, 1) distribution. We found this unreasonable in practice 
because it significantly reduces the power of the test technique. Rejecting the null hypothesis 
when the sequence {Zj, j = 1, 2, . . . N— 1} is not sufficiently like N(0, 1) is at odds with choosing 
A to make the sequence most like a Gaussian sequence. 

Peters and de Vilder (|2006h use a slightly different technique for A for testing their S & P 



500 data. Effectively they find successive times ti at which to evaluate X, where the increments 
(X)f. — (X)f. of the quadratic variation are greater than some arbitrarily chosen lower bound, 
but are minimal. 

Our approach is to find the mean quadratic variation increment 

g _ ( X ) S nS ~ ( X )ls 

n-2 

5 



and choose A by A = cS for some constant c. Instead of trying to decide on the 'right' value for 
c, we show test results for a range of values and use the ones most favourable to the quadratic 
variation approach (e.g. use the highest rejection rates when the null hypothesis is not satisfied.) 
Simulation results (shown below) illustrate that a range of values for c is generally acceptable. 
Since our goal is comparison with the crossing tree method, using a range of values is most 
forgiving to the quadratic variation-based method, but is a high standard for another technique 
to meet. We leave unanswered the question of how to select one value of A to perform the test 
but feel this is a drawback to the quadratic variation test. 

2.2. Statistical Tests. Testing if {Zf\ is an i.i.d sequence with a common iV(0, 1) distribution 
involves two questions. Are the values from an independent seque nce? Do the values have a 
common N(0, 1) distribution? Following Park and Vasudev ( 20051 ) we ignore the question of 



independence and test the distribution using the two-sided Kolmogorov-Smirnov (KS) statistic, 
the Cramer-von Mises (CVM) statistic and a standardised mean (SM) statistic 



s_ Y NA -Y A 



= V - 

y/N=T ^ 1 VA(JV-l) 

which is N(0, 1) under the null hypothesis. Park and Vasudev provide convergence results that 
show when these tests are applied to the {Zj } data, distributions of the test statistics converge 
to N(Q, 1) data as 5 — > 0, N — ► oo. Results from applying these tests are reported in Section [4] 
for simulated data and Section for tick-by-tick foreign exchange data. 

Peters and de Vilder (J200y) address the issue of testing distributions using the Kolmogorov- 
Smirnov and two other tests. Unlike Park and Vasudev, they also test for independence in the 
sequence. We haven't tried the additional tests they use, in part because they consider only one 
timeseries of S & P 500 data and our goal is also a more systematic comparison with other's 
results for simulated data. And while the Kolmogorov-Smirnov statistic is known to be sensitive 
to differences in the mean, the Cramer-von Mises test has been the more powerful test in our 
simulations and we think it provides a good basis for comparison. 

2.3. Type 1 Error. Under the null hypothesis, using tests with a significance level of 5% 
we expect about 5% of simulated sample paths to be rejected. To confirm the performance 
of our implementation we simulated a number of sample paths and counted the number of 
paths rejected using each of the three statistical tests. Figure [T] (left) shows the results when 
1,000,000 independent paths of Brownian motion were simulated. Each path is simulated as 
Xk = J2i=iZi, k = 1, . . . , 1250 and Xq = 0, where {Z\, . . . , ^1250} is an i.i.d. sequence of 
TV (0,1/250) values. This corresponds to simulating Brownian motion on [0,5] at the times kd, 
5 = 1/250. For a sense of the confidence intervals, using the Binomial(M,0.05) distribution, a 
95% confidence interval would use ±0.04% for M = 10 6 paths and ±0.4% for M = 10 4 paths. 
Figure [1] (right) shows similar results for paths of length n = 5000. The standardized mean 
statistic appears extremely good here, rejecting almost exactly 5%. The two other tests appear 
comparable. Note that values of c very close to zero seem to give slightly higher rejection rates 
for Kolmogorov-Smirnov and Cramer-von Mises tests. This is noteworthy because for virtually 
all other processes considered, the SM test will have little or no power and these other two tests 
will be the main ones to consider. Thus, results from those tests in which c is small must be 
regarded with some caution. For c = 20, {Zj } has length 58, on average across all the sample 
paths. So the statistical tests are using that many values. For c = 140, the tests are using 7 
values on average. Ordinarily one would expect smaller error when testing more data, but small 
c is equivalent to small A (in relation to 5) in forming the increments ([2]) and granularity in the 
sample quadratic variation becomes relevant. 

Figure [2] shows results corresponding to an exponential martingale Xt = exp(i?j — 1/2), where 
Bt is standard Brownian motion, simulated on [0, 1250] at equally spaced times with interval 
length 5 = 1/250. Here, 10,000 independent paths were used. At the left, c = 20 corresponds to 
testing 61 values on average, while c = 400 at the right corresponds to testing two values. The 
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Figure 1. Quadratic variation empirical type 1 error rates from 1,000,000 sim- 
ulated sample paths of Brownian motion with length 1250 (left) or 5000 (right) 
using tests at level 5%. Here 95% confidence intervals are of the order of ±0.04%. 
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Figure 2. Quadratic variation empirical type 1 error rates from 10,000 simu- 
lated sample paths of exponential Brownian motion with length 1250 using tests 
at level 5%. Here 95% confidence intervals are of the order of ±0.4%. 

repeating peaks arise from coarseness in the number of values {Zj} tested. The peaks at c = 
200, 250, and 310 correspond to the largest c for which {Zj } has length 5, 4 and 3, averaged 
across all sample paths, respectively. (Aside: there is very little variability in the number tested 
across the paths for fixed c.) The cumulative sample quadratic variation typically shows a steep 
upward portion for small t and a flatter portion for larger t, connected by a 'knee'. This has the 
effect that larger A makes NA larger for the same N. The presence of the knee magnifies the 
change in for different A to such an extent that the sawtooth behavior disappears if the 
same tests are applied to {Zj, j = 1, N — 2}, thus dropping Z S N _ X . Since a 5% rejection rate is 
expected under the null hypothesis, these results further emphasize that while values of c too 
small should be avoided, values of c too large should also be avoided because of a lack of data. 
Values of c between 20 and 140 seem acceptable for length 1250. (For length 5000, values of c 
between 20 and 400 seems acceptable.) Within this range, the rejection rates are between 3.5% 
and 5% for all three tests. Park and Vasudev report rejection rates of 4% (KS), 5% (CVM) and 
4.4% (SM) so the results are comparable. 
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3. Crossing Tree-based Test 



3.1. Characterizations of BM an d CL M using the Crossing Tree. The crossing tree 
was introduced by Jones and Shen ( 20041 ) in the context of self-similar processes. In this 
section we describe the crossing tree then show that it can be used to give a characterization 
of Brownian motion time-changed using a continuous chronometer. Using this characterization 
we give various tests for the continuous martingale hypothesis. 

Fix 5 > 0. Our definitions depend inherently on 5, but as it remains fixed throughout we will 
not include it in our notation. Let X be a continuous process, X(0) = 0, then for all I £ Z we 
define crossing times (more precisely first passage times) by putting Tq := and 

T) := w£{t > T\_ x : \X(t) - X{T]_ X )\ = 2 l 5}. 

By a level I crossing (equivalently size 82 l crossing) of the process X we mean a section of 
the sample path between two successive crossing times Tj_ x and Tj plus the starting time and 
position of the crossing, T^_ x and X(Tj_ x ). Let Cj be the j-th crossing of size 52 l . There is a 
natural tree structure to the crossings, as each crossing of size 82 l can be decomposed into a 
sequence of 'sub crossings' of size 52 l ~ l . We identify vertices of the tree with crossings and link 
each level I crossing with its level I — 1 subcrossings. This is illustrated in Figure O Define the 
crossing length Wj. := T k — T k _ x ] orientation a k := sgn(X(T k+1 ) — X(T k )); and the number of 
subcrossings Z k . 

Subcrossing orientations come in pairs, either H — , — h, ++ or , corresponding respectively 

to excursions up and down and direct crossings up and down. The subcrossings of a crossing can 
be broken down into some variable number of excursions, followed by a single direct crossing, 
where the orientation of the direct crossing is the same as the orientation of the crossing. Let 
if the j-th level I excursion is up (H — ) and V\ = 1 if it is down ( — h). 



yf 

3 



sample path and crossings (levels 3 to 4) 



crossing tree (points give start of crossing) 





Figure 3. The crossing tree associated with a continuous sample path. Here 
5 = 1. In the left frame, for / = 3 and 4, we have joined the points (Tj,X(Tj)); 
we see that the single level 4 crossing can be decomposed into a sequence of four 
level 3 crossings. In the right frame we have plotted the points (Tj,52 l ) for all 
l,j > 0, then, identifying crossing Cj with its starting time Tj_ l5 we joined each 
point to the points corresponding to its subcrossings. 



Theorem 1. Brownian motion is the unique continuous process B for which: 
BM0 B(0) = 0, Var[B{\)] = 1; 

BM1 For each I the W k , k = 1,2,..., are i.i.d. with Laplace transform Eexp(— \W k ) = 
l/cosh(2- /+1 /2Ai/2). 

BM2 The Z\ are i.i.d. for all I and k, with F(Z l k = 2i) = 2~\ i = 1,2, . . .; 
BM3 The Vj are i.i.d. for all I and j, with ¥(Vj = 0) = F(Vj = 1) = 1/2. 
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Proof. This characterisation of Brownian motion, in terms of its crossings, is based on ideas 
used in the construction of Brownian motion on a nested f ractal . To see thes e idea s in their 
original context, the reader is referred to Barlow & Perkins (1988) and Barlow (|l993l ). 

Given a Brownian motion B, it follows from the strong Markov property that the Zl are all 
independent. Moreover since Brownian motion is statistically self-similar, they are identically 
distributed. The distribution of Z l k is just that of the time taken for a simple random walk on 
Z to hit ±2, starting at 0, whence BM2 follows. 

From the strong Markov property we also have that for each I, the Wl are i.i.d., and self- 
similarity shows that W\, = 4'Wj\ The distr ibution of W® can be found using martingale 
techniques (see for example Burq & Jones 20081 ). which gives BM1. 



To see that BM3 holds, consider an up-crossing: the orientations of its subcrossings are the 
same as the steps taken by a simple random walk on Z, starting at and conditioned to hit 2 
before —2. 

Condition BMO just specifies the scaling of the process. 

Now suppose that we are given a continuous process B satisfying conditions BM0-BM3. Put 
X l {k) = B(T l k ), and for I < m let N l > m be the first time X 1 hits X m (l) (so iV M+1 = Z[ +1 ). 
Conditions BM2 and BM3 specify the distribution of {X l (0), X l (N l > l+1 ) \ X l+1 (0) , X l+1 (l)} , 
and thus by induction t he distribution of |X ' (0), . , X l (N l ' m ) | X m (0) , X m (I)} , for any I < m. 
(In the terminology of Barlow and Perkins! ( 19881 ). the random walks X 1 , I G Z, are nested.) 



It is straightforward to show that we get precisely the same laws for the subcrossing num- 
bers and orientations if we take the simple symm etric random walk on 52 l Z, started at 
and run it until it hits ±52 m . (See I J ones! ( 19961 ) for the explicit calculations.) That is, 



{X l (0), . . .,X l (N l ' m ) | X m (0),X m (l)} is a simple symmetric random walk on 52 l Z, started at 
and conditioned to hit X m (l) before -X m (l). 

From BM2 and BM3 we have that for any m E Z and n G Z + , 

P(X m (l) = 52 m \X m (0) = 0) = ^F{X m+1 (l) = 52 m \X m+1 (0) = 0) + \ 

1 n 1 1 

= -F(X m+n (l) = 52 m+n \X m+n (0) = 0) + J2 ¥ +i = a" 

i=l 

Thus, removing the conditioning on X m (l), X l (k) is indistinguishable from a simple symmetric 
random walk for Jfc = 0, ... , N l ' m . But N l > m > 2 m ~ l , so sending m->oowe see that X 1 is just 
a simple random walk on 52 Z. 

It is well kn own that as / — > — do, 4 l X l converges a.s. to a Brownian motion, X say. Moreover 
Hamblvl (jl995T ) shows that, up to a constant scaling, the crossing times of X have the distribution 



given by BM1, so X is just a scaling of B. Thus from BMO, B is a standard Brownian 
motion. □ 

Remark 2. Our definition of the crossing tree considers the process when it hits new points on 
the lattice Xq + 52 lr L, for all levels I G Z. We can just as easily consider lattices a + 52 l Z, by the 
simple modification of putting Tq = inf{t > : X(t) G a+52 lr L}. Similarly, our characterisation 
of Brownian motion can be generalised to allow for lattices centred at an arbitrary point a. The 
•pro of is essentially the same, but does require more care with the nested random walks X 1 , as 
per \Barlow and Perkim (1988) Theorem 2.14. 



Clearly the Vj and Z{ are invariant under a continuous time-change, so a continuous local 
martingale must satisfy BM2 and BM3. We show below that these properties characterise a 
continuous local martingale, up to a shift at time 0. To do so we need to know a little more 
about the crossing times of Brownian motion, which the following lemma provides. 

Lemma 3. Let {P(n)}^ =0 be a supercritical Galton-Watson branching process, with P(0) = 1. 
That is, P{n) is the population size at generation n and fi := EP(1) > 1. Suppose that a 2 := 
VarP(l) < oo and let be the normed limit of the tree branching from the k-th individual in 
generation n. (SoM,LZ = l, VarL^ = a 2 / (fj, 2 — fj) and = \\m n ^ 00 ^ n P{n) a.s. and in mean 
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square.) Then 



lim max a n Ul = a.s. 

n->ooo<k<P(n) 



Proof. Our proof uses extreme order statistics for Galton- Watson trees (Pakes Il998l ). From 
Chebychev's inequality we have F(L k > x) < x~ 2 E(L' k ) 2 = x~ 2 (l + c 2 /(/i 2 — fi)). Thus we can 
find i.i.d. random variables M k such that for all x, P(-M > x) < P(M^ > x) ~ cqx~ 2 . The law 
of M k is in the domain of attraction of the extremal distribution exp(— x~ 2 ), that is, writing 
M for a generic M k , 

lim zF(M > Xy/c^z) = x~ 2 . 



Thus from Pakes ( 19981 ) Theorem 4.1 we have, for H\{x) = Eexp(— x 2 P(1)), 



lim P [ max a~ n / 2 M^ < xJc~ 

n^oo \0<k<P(n) j 



HAx). 



Finally, max <fc<p( n ) /u n L k is a strictly decreasing function of n (since each L k can be written 
as a sum of terms // _1 L™ +1 for various j), so for any e > 



lim max u 

n^oo Q<k<P(n) 



- n Vl. < e 



lim F max u 

n^oo \0<k<P(n) 



' n Ll < e 



> lim F f max n~ n Mj} < e ) 

n~*oo \0<k<P(n) J 

= lim F ( max a~ n / 2 M^ < a n l 2 e 

n->oo \0<k<P(n) " 

= lim H\(x) = 1. 



Sending e — > gives the result. 



□ 



Corollary 4. A continuous process X is a continuous time-change of Brownian motion, equiv- 
alently a continuous local martingale, if and only if 

CLMO Z(0) = 0; 

CLM1 The Z{ are i.i.d. for all I and k, with F(Z l k = 2i) = 2~\ % = 1,2, . . .; 
CLM2 The V l k are i.i.d. for all I and k, with F(V k l = 0) = ¥(V k l = 1) = 1/2. 



Proof. The 'if part is clear, since Z l k and Vj are unaffected by a continuous time-change. 

We show the 'only if part as follows. Let W k be the crossing times of X. Properties CLM0- 
CLM2 are enough for us to construct a Brownian motion B, with subcrossing family sizes Z- 



whose crossing time s Wj are obtained as normed limits of the Galton- Watson processes defined 
by the Z l k fHamblv fl~995h . By construction we have B(f l k ) = X{T l k ), where T k = J2j<k Wj and 
T{ = Ej<k Wj. Defining 9(T l k ) = f{ we get, for t = T l k , B(9(t)) = X(t). 

Since X is continuous, for any t we can find a sequence {k(t,l)} 
* e \ T k{t l)i T k{t We use tnis to exten d to [0, oo): for t G [0, oo) put 9(t) ■■ 

The result now follows provided that 9 is continuous, that is provided lim^„ 
all t. From Lemma [3] we see that this holds for almost all sample paths of B. 

Finally, by construction we have that B(t) and 9(t) are J~t measurable, where {J~t} is the 
filtration generated by X. □ 



fZ-oo such that for all I 
linn^oo fl 



-OO WlfoQ 



k(t,l)- 
for 



Moreover, by Remark [2] this characterisation generalises to X(0) = a for constant a G R. Also 
notice we do not need to assume the time change is independent of the Brownian motion. 
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3.2. Small scale diffusive behaviour. We show that for regular diffusions, at sufficiently 
small scales, the diffusion dominates the drift and these processes look like (continuous time- 
changed) Brownian motions. A key idea for these results is the scale function (see Appendix 
IA"|) which allows computing the probability ps(x) that a size 5 crossing starting at x is an 
up-crossing. 

Lemma 5. For a continuous strong Markov process X, ifps(x) is a constant ^ or 1 then the 
{V^ } are i.i.d. Bernoulli (1/2). 

Proof. Excursions are equiprobable iff for all x £ b7L 

X(T fe ° +1 ) = x + 6, X(T° +2 ) = x | X(T fc °) = x, X(T fc ° +2 ) =x)=\. 
That is, 

Ps(x)(l -ps(x + 6)) 1 

pa 0*0(1 -ps(x + 5)) + (1 -p s (x))ps(x -5) 2' 
which clearly holds if ps(x) is constant and non-degenerate. If ps(x) does not depend on x, then 
from the strong Markov property the crossing orientations {a^}fc and thus the excursions must 
be independent. □ 

An immediate consequence of this result is that CLM2 holds for any continuous time-change 
of Brownian motion with drift. The next lemma shows that for a large class of diffusions, CLM1 
and CLM2 hold approximately at small scales. That is, at small scales, these diffusions looks 
like continuous local martingales. 

Lemma 6. Suppose X is a continuous regular diffusion on some interval, with differentiable 
scale function s and non-absorbing boundaries, then as 5 — > 0, {Z\}^\ and {V^}^Xi converge 
infinite dimensional distribution to i.i.d. sequences, with distributions 2- Geometrical/ 2) and 
Bernoulli{l/2) respectively. 

Proof. For any x in the interior of the range of X we have from ([7]) that 

£5*M = M) = 1/2 ' 

If ps(x) does not depend on x, then from the strong Markov property the crossing orientations 
{a^,}fc must be independent. Thus we see that as S — ► 0, {a®} converges in finite dimensional 
distribution to an i.i.d. Bernoulli(l/2) sequence. That is, {X(Tf})}k converges to a simple 
symmetric random walk. The result now follows as per the proof of Theorem [TJ □ 

3.3. Test statistics. We test the continuous martingale hypothesis by testing properties CLM1 
and CLM2 of Corollary [4j Formally, property CLM1 characterises the sample-path variation 
while CLM2 characterises the symmetry of the sample paths. To test CLM1 we need to test the 
distribution of the ZL and their independence. To test CLM2 we will test that for each level, 
amongst the up-down and down-up pairs of subcrossings, each has probability 1/2 and they are 
independent. The last pair for any subcrossing is up-up or down-down, and is not included in 
the testing. 

We suppose that we have points {(Tj, X{), i = 0, . . . , M}, with X, = X(Ti) as observations of 
the continuous process X(-) from time interval [0, T\. With this we can construct a continuous 
process on [0, T] by connecting consecutive points by linear interpolation to yield 

Y(0) = X(0), Y(t) = X * ~ ^ fc ~ 1 (t - r fc _ x ) + Xfc-i, t € (r fc -i, T k \. 
J-k — J-k-l 

Then for a given 5 > and e>o £ R we can construct at level (i.e., size 5) crossing times T® G 
[0, T] and corresponding crossing types a\. In practise we use 5 = mediam ;= i ) ... ) M{|^fc — Xk-i\} 
for the level crossing size, but this is arbitrary and not crucial since we test across multiple 
levels 2 5. Since we also simulate crossing data it is helpful to use a definition that recovers 
exactly the crossing size in the data when it has that form. 

li 



For the choice of 6q, although 5q = seems natural, there are advantages to other choices. 
Under the null hypothesis, if So is independent of {X(T k ), k > 0} the distribution and i.i.d. 
properties of {V k } and {Z l k } remain unchanged. For example, one could take <5o = X(0), since 
under the null hypothesis X{t) — X(0) is a co ntinuous local martingale an d X(0) is independent 
of the Brownian motion {X(/9 _1 (i)), t > 0} (jKaratzas and Shrevel Il99ll . p. 178). 

To improve the power of our test against stationary alternatives (e.g., Ornstein- 
Uhlenbeck processes) we use something slightly more complicated. We imagine our data 
is {(Tk,Xk), k = — 30, ...,M}, and use {(Tk,Xk), k = — 30, 1} to form the first 30 
crossings {(T%, Y(T k ), k = — 30...,— 1}. We use those to form the 'latticised mean' 6q = 
EfcI- 30 ^( T °)/30, and then use {(T k ,X k ), k = 0, . . . , M} to form Y{t), {V k }, and We 
estimate the mean using crossings, not the data directly, to ensure 5q is independent of the 
tree, since dependence could enter though #(•). All of the simulation results reported below 
were tested three ways, using 5q as 0, Xq (the first data point), or the 'latticised mean' just 
described. The latticised mean shows discriminatory power as good or better than the other 
two possibilities. Substantially larger power was seen for mean reverting alternatives where the 
mean is non-zero. Since the type 1 error is unaffected we feel using this 'latticised mean' is the 
best for building the crossing tree. We leave for future work determining how much data to 
dedicate to estimating the mean. 

From these definitions we can derive the crossing times Tl, excursion types Vl and subcrossing 
family sizes Zl for level 1 up to some maximally observed level m. Throughout, let N(l) 
and Ny(l) be the number of observed crossings and excursions, respectively, at level I for 
I = 0,1, ... ,m ■ Note that we do not observe Zl for I = 0. Also note that since the tests on 
{Z l k } and {Vl} are invariant to changes in the time co-ordinate of the data, it is enough to have 
timeseries of the form {(i, Xi), i = 0, . . . , M}. 

With the crossing tree for the data thus defined, we can state more precisely the basic question 
our test addresses. When viewed crossing the lattice 5q + 2 l 5Z, does a linear interpolation of 
the data process look like a continuous time-changed Brownian motion? 

For 5 very small, the linear interpolation will provide numerous crossings between data points 
and Z l k = 2 will be frequently observed at small levels I. The test results will be an artifact 
of the small 5. In these cases the null hypothesis will be rejected simply as a consequence of 
small 5. Practically, this is a familiar issue when using discrete data where continuous paths 
are assumed. At resolutions that are too fine, the data doesn't "look" continuous, but this isn't 
the main point of the test which concerns whether the data is from a continuous time-changed 
Brownian motion. Practically we just test at larger 5 too. 

It is common to have observations of a process at regular (e.g., hourly, daily) points in time. 
For building the crossing tree, this will generally mean some crossings are unseen. This effect 
is mitigated if the time scale of the observations is small compared to the time scale of the 
crossings. Alternatively, if we have observations of a process when it changes (e.g., financial 
tick data) then we don't lose crossings and are just testing continuity as part of the test. 

All of our tests using subcrossings are be based on {Z[, Z\, . . . , Z l N ,n} and {V/, V\, . . . , V^m} 
for each fixed I G {1,2, . . . , to}. This allows us to examine the process X at different scales. 
This is very important from a practical perspective for two reasons. Firstly, observed processes 
invariably have a limiting resolution below which they are discrete rather than continuous, so 
continuity is a modelling assumption that can only apply above a certain scale. Secondly, one 
often sees different mechanisms at work at different scales and we should not expect a single 
model, such as continuous time-changed Brownian motion, to be a good approximation at all 
scales. For finance in particular, it is generally accepted that markets exhibit microstructure at 
small scales that is not seen at larger scales. 

To relate the physical scale of crossings to a temporal scale we use average crossing times. An 
observation of Z k is based on level I — 1 crossings (of size 52 l ~ l ). Accordingly we use EW^ 1 for 
the temporal scale corresponding to the physical scale 52 l ~ 1 , which we test using the Zl. We 
remark that in general, for fixed I, the W k are not stationary, so care needs to be taken when 
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estimating and interpreting E^. In particular high-frequency financial data typically exhibits 
daily an d weekly periodicity in vo latility, which is expressed as daily and weekly periodicity of 
the W l k ( Arroum and Jones . l200d ). 



3.4. Distribution Tests. We consider several tests of the null hypothesis that, for each 
level I, the subcrossing sizes {Z\, Z%, . . . , Z N ,^} form a random sample of values from a 
2-Geometrici(l/2) distribution. 

3.4.1. Twos Test. Under the null hypothesis, {Z[, Z\, . . . , Z^ny} is a random sample of val- 
ues from 2-Geometrici (1/2), and so '2' is expected to be the most frequent number. Let 
T be the number of twos in {Z[, Z\-, ■ ■ ■ , Z 7 ^,^}. Then, conditional on N{1) = M, T has a 
Binomial(M,l/2) distribution under the null hypothesis. This is exact even for short datasets 
where tests based on asymptotic methods may have problems. We reject the null hypothesis 
when the value of T is too close to or M. 

3.4.2. x 2 -Distribution test. We use a x 2 -^est to compare the empirical distribution of the 
{Z l k }fc=i against the distribution given in CLM1, that is 2 • Geometrici(l/2). 

For i > 1 put 0{ = #{k : Z k = 2i} and E$ = N(l)2~ l (observed and expected frequencies). 
Let Oi + = J2j>i Oj and Ei + = J2j>i Ej-, then our test statistic is 

(3) D { l,d) = Y,^-^ + {0d+ - Ed+)2 . 

Under the continuous martingale hypothesis, for fixed / and d, D(l,d) — ► Xd-l as -^(0 — ¥ °°- 
We reject the hypothesis (at level I) if D(l,d) is large. 

It is usually recommended that the approximate x 2 distribution is only used if the expected 
frequencies are at least 5. In our case the smallest expected frequencies are E^-i = Ej+ = 
A^(/)2 - ^ -1 ^. Since the smallest sensible value for d is 3 (2 degrees of freedom), this would 
suggest that: 1) don't apply the test for N(l) < 20, 2) use d = 3 for 20 < N(l) < 40, 3) use 
d = 4 for 40 < N(l) < 80 and d N(fi = [log 2 (N (I) / 5) + lj, N(l) > 20 in general. 

Since under the null hypothesis we know we have a 2- Geometrici(l/2) distribution, we have 
been able to improve on the rule of thumb. For N(l) < 13 we still do not apply the test. For 
each of the 26 values for N(l) in {14, 15, . . . ,39} we used Monte Carlo simulations to obtain 
empirical critical values. For each M in {14, 15, . . . , 39}, 10,000,000 datasets were used to create 
empirical critical values Cm, 0.95 corresponding to the 0.95 quantile of the observed test statistic 
with d = 3. We then reject the null hypothesis if D(l, 3) > Cjvmo.95- 

Following numerous simulations, for N(l) > 40 we have found using two extra bins, so 
+ 2, is preferable to dfjny Simulations for various values in 40 < N(l) < 300 show that for 
a test at 5% significance level, the type 1 error is usually not much larger (e.g. 5.2% vs. 4.8%), 
and is far outweighed by the substantial power the test gains. 

We note here that stationarity of the Z l k can also be examined using a x 2 -test. If we split 
the sample into m parts, calculate D{l,d) over each part then sum, the resulting statistic will 
be asymptotically X^d-i)- 

3.4.3. G Test. Besides the x 2 tests described above we implemented several other tests. The 
so-called 'G-test' based on the log- likelihood ratio forms the test statistic 

°d = 2 (j2 °i HOi/Ei) + O d+ In {O d+ /E d+ )^ 

and has the same limiting x\-\ distribution as N(l) — ► oo. We used the basic rule dj^m = 
Llog 2 (iV(0/5) + lj. 

13 



3.4.4. Discrete Kolmogorov-Smirnov. For continuous data, the Kolmogorov-Smirnov goodness- 
of-fit test is thought to be more powerful than a "binned" test like the y 2 because the former 
makes use of the natural order to the data which is lost through binning (Horn, 19771 ). As such, 
a Kolmogorov-Smirnov test for discrete data should be preferable. Unlike w i th co pntinuous 
data, the distribution of the test statistic is not distribu tion free. Conover ( 19721) provides 
a way to approximate critical values for short datasets. IWood and Altavelal ( 197a ) suggest 
estimating the critical value of the test statistic using a Monte Carlo simulation to generate 
multivariate Normal vectors with calculable covariance matrix, with dimension one less than 
the data length. Our approach is more straightforward. Using standard definitions for datasets 
of length M, Fm(x) = #{Xi < x}/M and H{x) are the empirical c.d.f. and the c.d.f. under 
the null hypothesis. The two-sided test statistic is 



D M = max M 1 / 2 \H(x) - F M {x)\ 

where the maximum is over the set of jump points J. For each data length M from 2 to 1000, 
1,000,000 datasets were simulated under the null hypothesis to form as many test statistics. 
The 0.95 quantile of those provided the critical values Cm, o. 95- If ^ 1000 we reject the null 
hypothesis if D n(Jl) > C N ^ }0 _ 95 . If N(l) > 1000 we reject if D N ^ > Ci oo,o.95, using Ciooo,o.95 
as our estimate of the asymptotic critical value, which the data suggests is reasonable. 



3.4.5. KLP 1998. Finall y, a test based on the first two moments is described by 
Kvriakoussis et al. (|l998h . In particular, we use their results to test if the data is from the 
particular negative Binomial distribution that is the geometric distribution. Their test statistic 
Tnb is built around a standardized version of 

- 

C (in \ 2 ' 

and is asymptotically iV(0, 1). Thus we reject if |Tjvb| > Co.975 where C0.975 is the 0.975 quantile 
of the -/V(0, 1) distribution, which asymptotically would provide a test at 5% significance. 



3.5. Independence tests. Testing for independence in a sequence of (stationary) random 
variables is not straight-forward. In the absence of a sufficient omnibus test we have tried a 
variety of tests, and report on the use of several key ones here. 

Linear dependence can be measured using the autocorrelation, and we include such a test. 
One can consider the autocorrelation at many lags, however we restrict ourselves to lag 1 
autocorrelation because in practice a short term dependence in the increments of X, and thus 
a lag 1 dependence in {Z l k }^^, is the most plausible departure from independence. 

The most fundamental approach to testing the independence of a sequence Ai,A%,... is, for 
r = 2, 3, . . ., to estimate the joint distribution of A^, ^4fc+i, • • • , A^+r-i and compare this with the 
r-th power of the estimated marginal distribution. For continuous distributions, estimation of 
joint distributions can be done in a distribution free manner, and there are ge neral approa c hes to 
testi ng indepe n dence using estimates of joint distributions. See for example Blum et al. ( 196ll ) 
and iRobinsonl (1991). For estimating joint distributions of discrete random variables, ad-hoc 
approaches are required. 

Other existing tests are built around runs and clustering in the data. Fix I and let = 1 if 
Z l k = 2 and If, = otherwise. Let Ui and V{ be the length of the i-th run formed by consecutive 
zeros and ones, respectively. Let no and n\ be the number of zeros and ones in {Ji, . . . , Ijv(/)}- 
Under the continuous martingale hypothesis I±, . . . , Ijv( n ) are an i.i.d. Bernoulli (1/2) sequence, 
and {Ui, i = 1, . . . , iVo} and {Vi, i = 1, . . . , iVi} are two i.i.d sequences, where iVo and Ni are 
the number of runs of zeros and ones respectively. Tests of independence can be built around 
the binary sequence {ii, . . . , In(i)} and its related quantities. 
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There are a variety of other tests that could be used, but it is impossible to implement and 
report on all of them. One such test, designed to d etect non-linear dependence using the so- 



called correlation dimension, is that of lBrock et al. (1996). We feel the tests we have included 



are a fair cross-section of the known tests for independence for finite distributions. 



3.5.1. Autocorrelation test. There are many approaches to testing for linear dependence 
in a sequence of random yariables, based on t h e autocovariance. In pa rticular those of 
Durbin and Watsonl (|1950M Box and Pierce) (|l970l ). ILo and MacKinlavl (|l988l ) (variance ratio) 
and (|Durlauill99lh (spectral approach). However for simplicity we restrict ourselves to the lag 
1 autocorrelation. 

Let Z l be the average of Zf, 



N(l) 



then our test statistic is 



(4) 



h(l) 



v JV(i)- 



\Z[ +1 -A){Z[-A) 



e3(4-^) 2 



Note that we use the known mean of Z l k in the numerator but not the denominator, as this 
helps inflate when the null hypot hesis is false. Using sample means in the numerator, we 
know from Kendall, Stuart and Ord (Il983l ) that h(l) « N(-l/N(l),l/N(l)) for N(l) large. 
However, using the known means under the null hypothesis, we have the following result. 



Lemma 7. 



0V(i)Ji(O N(0, 1) as N(l) 



oo. 



Proof. For fixed I, under the null hypothesis, {Z l k } k ^ 



is an i.i.d. sequence with common 
distribution 2 • Geometrici(l/2). So, for all k we have E[Z£] = 4 and VarfZjp = 8. Now, let 
(4+1 -4X4-4) and 



_iv(0-i 
2^k=l 



We have E[U l k 



Since {U l k , k 



8y/N® - 1 

0, Vai[U l k ] = 64, Cov{Ul,U l j) = for i ^ j and VarpZ}] =. vv 
1, . . . ,N(l) — 1} is a stationary m-dependent sequence with m = 1, by Ibragimov and Linnik 

( Ibragimov and Linnikl . 1971 . Theorem 19.21), 7} iV(0, 1) as N(l) — > oo. By consistency of 
the sample variance estimator, 



wo - 1) 



Efi?(4-^) 5 



1 as N(l) — > oo, and 



iv(0/i(0 



8(N(l) ~ 1) 



1 s^N(l) (7 .i 
L l^k=l V z k 



z l y 



-Ti -=* N(0, 1) as N(l) 



oo. 



□ 



Thus ~ iV(0, 1/N(l)) for iV(Z) large and we reject the continuous martingale hypothesis 
(at level I) if |ii(0l is large. In practice we found this asymptotic result has acceptable error 
under the null hypothesis for N(l) > 100. For N(l) < 100 we used empirical critical values 
again. In particular, for M in {5, 6, . . . 100} we simulated 10,000,000 datasets of length M to 
create estimated critical values Cm, 0.025 and Cm, 0.975 using the 0.025 and 0.975 quantiles of the 
generated test statistics. For 5 < N(l) < 100 we reject the continuous martingale hypothesis 
(at level I) if < C , jv(;) i o.o25 or ^i(0 — Ctv(z),o.975- We found using both critical values more 
robust than using one and assuming |/i(/)| is symmetric. For N(l) < 5 there just isn't enough 
data to reject the null hypothesis. 
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3.5.2. Joint Distribution test. Because estimating joint distributions requires a lot of data, we 
restrict ourselves to the bivariate case. In our case the joint distribution of Zj. and Z l k+l is 
known under the null hypothesis, so we can compare the empirical joint distribution against 
the known distribution using a x 2 -test. Fix I and split the observations Z[, . . . , Z l N g\ into pairs, 

{Z\ k _ x , Z l 2k ) for k = 1, ... , [N(l)/2\ . For i,j>l put O id = #{k : (Z l 2k _ v Z l 2k ) = (2i, 2j)} and 
E{ j = [N(l)/2\ 2~^ +J ) (observed and expected frequencies), make the definitions 

Q+j = Ohji Oij+ = 2J O iy k, Oi + j + = Oh,k 

h>i k>j h>i,k>j 

and similarly for -Ei+j, Eij + and Ei + j + . Our test statistic is 

i,i=l ^ i=l 



(5) 



(Pd+j ~ E d+,j) 2 {Od+,d+ ~ E d+,d-\ 

j=l E d+,j E d+,d+ 



\2 



Under the continuous martingale hypothesis, for fixed I and d, l2(l,d) — > Xrp—i as -^(0 — * °°- 
We reject the hypothesis (at level /) if l2(l,d) is large. 

It is usually recommended that the approximate \ 2 distribution is only used if the ex- 
pected frequencies are at least 5. In our case the smallest expected frequency is Ed+,d+ = 
LA^(/)/2j2 -2 ^ -1 ) . Since the smallest sensible value for d is 3 this would suggest we need 
N(l) > 160. Following extensive simulations we have found we can use d = 3 for N(l) > 10, 
significantly smaller than 160. Figure H] shows the rejection rate and 95% confidence intervals 
from simulations of 100,000 datasets of length M between 10 and 100 corresponding to the null 
hypothesis. These show a rejection rate of at most 5.7% from a test of significance level 5%, 
which we believe is acceptable considering the benefit from applying the test to much shorter 
datasets. We do not apply the test to datsets with length below 10. 

This test could reject either from bi-variate dependence or a departure from the hypothesized 
marginal geometric distribution. Labelling this test as an independence test is very deliberate 
on our part. Notice that if the sequence of subcrossings {Z k } is randomly permuted, the same 
test could also be applied, but the cause of any departure from the null hypothesis is confined 
to a departure from the geometric distribution. In fact we have done this with our simulation 
data and found, consistently, that a substantially smaller percentage of paths is rejected after 
permuting the subcrossings. This tells us that dependence, not distribution, is the main reason 
for rejection with the method. 



3.5.3. Wald and Wolfowitz. The test of Wald and Wolfowitz (ll940T ) is based on the number of 
runs in . . . , 1^(1)} ■ Too few indicates a tendency to cluster, too many indicates a tendency 
for values to alternate. Asymptotically the test statistic is N(0, 1). It is implemented in Matlab 
as 'runstest' which allows computing a p- value using the exact distribution for all lengths instead 
of the asymptotic N(0, 1) limiting distribution. 



3.5.4. O'Brien 1976. The test of O'Brien (|l976h applies the results of Dixon (jl940h for two 



independent samples to the situation of the binary sequence, and is sensitive to multiple clus- 
tering. Without loss assume m > no, so ones are more numerous. This test uses the biased 
version of the sample variance s 2 of the sequence {Vi, . . . , V^}, conditional on no and n\. Too 
much or too little variability indicates a tendency to cluster. O'Brien suggests that for cs 2 where 
c is known the distribution is asymptotically x\> where v is a function of no and m, and a table 
of critical values can be used when no and n\ are less than or equal to 10. Unfortunately v can 
be a non-integer, which leads to some approximation error rounding v up or down. Using an 
idea alluded to in Dixon, instead of a x 2 distribution we use a Gamma distribution for the test 
statistic, which allows the shape parameter to be fractional and for less approximation. For no 
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Joint Distribution Test: Empirical Type 1 error with 95% C] (1 e5 iterations) 
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Path Length(n) 



Figure 4. Empirical type 1 error with 95% confidence interval for our joint 
distribution test for datasets of length n > 10 using 10 5 independent iterations 
for each n. 

and n\ less than or equal to 10 we use empirically generated critical values for a test at 5% 
significance. 

3.5.5. O'Brien & Dyck 1985. O'Brien and Dyck ( 19851 ) extend the sample variance-based 



method to using a weighted combination of the sample variances for the zeros and ones: 
T = cqSq + c\s\ which is approximately Xu +u 1 where the degrees of freedom can be calcu- 
lated. The weights c\ and C2 are chosen so that cqSq and c\s\, which are two asymptotically 
independent x 2 statistics on their own, have equal scale parameter. This approach again requires 
rounding up or down to create an integer degrees of freedom, or using a Gamma distribution. 
Potthoff (|l985ft raises several criticisms of the choice of the weightings, and discusses alterna- 



tives, but goes no better in establishing which combination might be better. He also suggests 
that since the contributions of Sq and si are asymptotically independent Normal, the sum is 
asymptotically Normal, with known mean and variance. We have tried four test statistics with 
related asymptotic distribution, three using the weighted sum as either a x 2 distribution with 
degrees of freedom rounded up or down, or a gamma distribution. Our fourth test statistic 
is Potthoff 's sum which, when standardized, is asymptotically iV(0, 1) and does not involve 
any weights. We found the Normal approximation lacked power, and rounding the degrees of 
freedom down consistently rejected too much. As such, we will only report on results using the 
gamma distribution approximation of the weighted sum. 



3.5.6. Larsen. The test of Larsen et al. (1973) is a test for unimodal clustering. For n\ successes 



let the locations of those successes be 1 < R\ < R2 < • • • < R ni < N(n). Let 

ni 

K 1 = Y j \Rt - R\ 



where R is the sample median of {R±, . . . ,R ni }. The test statistic is the standardized version 
of K\ which is asymptotically Normal. For datasets of length M in {3, 4, . . . 80} we used 
empirical critical values again. In particular, we simulated 10,000,000 datasets of length M to 
create estimated critical values Cm, 0.025 and Cm, 0.975 using the 0.025 and 0.975 quantiles of the 
generated test statistics. For M > 80 we use the asymptotic N(0, 1) approximation to the test 
statistic. 

3.6. Empirical Type 1 Error. First, we demonstrate the performance of the crossing tree 
technique when the null hypothesis is satisfied. The first 1250 crossings of standard Brownian 
motion were simulated using the technique described in Appendix [Al To obtain 1250 crossings 
corresponding to standard Brownian motion on [0,5], we used 5 = l/(5\/l0) as the basic 
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crossing size. Figure [5] (left) shows the results for the distribution tests applied to the sequence 
of subcrossing lengths {Z l k , k = 1, . . . ,N(l)}. At the smallest levels all five tests reject close 
to 5%, with the Twos tests rejecting the fewest paths, at about 4.5% at level 1. The x 2 test 
with two extra degrees of freedom and the G test reject close the level of the test. These are 
both "binned" tests and similar performance is reasonable. As the levels, I, increase the length 
of subcrossing data {Zl, k = 1, . . . , N(l)} decreases. For Brownian motion, each increase in 
level decreases the length N(l) by a factor of four on average: at level one N(l) is about 305 on 
average, at level 4 it is about 5 on average. Thus the percentage of 10,000 paths rejected falls, 
since N(l) becomes too small to allow a test. The smallest length we allow varies with the test. 
Table [TJ shows the data from the graph in boldface, and also the percentage of paths rejected 
out of the number tested, and the number tested. At level 4, all of the paths were shorter 
than 14, which is our cutoff to apply the \ 2 test with two extra degrees of freedom, so the test 
was never applied. On the other hand, the Twos test is an exact Binomial test which we can 
apply for all values of N(l). Our implementation of the discrete Kolmogorov-Smirnov test uses 
empirically generated critical values, so it can also be applied for all lengths, and performs well 
with shorter datasets. 

Figure [5] (right) shows the results from applying tests of independence to the subcrossing data 
{Z l k , k = 1, ... ,N(l)}. The joint distribution-based test shows rejection at about 5% across 
levels one to three, with little reduction as level increases, until a large drop at level 4. The 
autocorrelation test starts below 5% but increases to a value closer to the 5% value expected. 
The four "classical" tests of independence we have chosen to include appear qualitatively similar, 
although the Runs test rejects fewer paths than the other three. The Matlab implemenation 
of the Runs test is exact and can be applied to datasets of all lengths, so its performance is 
disappointing here. 

Figure [5] (right) also shows results from applying our four "classical" tests of independence 

to the binary sequences {V k 1 }^^ representing the the excursions at level I. (These tests are 
indicated with the suffix 'UD' for Up-Down in the legend.) The results mirror those from the 
same tests applied to the subcrossing data, with the Runs test rejecting a noticeably smaller 
percentage of paths than the other three. From the totality of the tests, saying 5% rejection is 
reasonable (perhaps somewhere between 4.7% and 5.7% on closer i nspec tion) . This compares 
favourably with the quadratic variation method. Park and Vasudev (2005) found 6.4% rejection 
using a two-sided KS test, 5.6 % rejection using a CVM test, and 4.4% rejection using their SM 
test. Our implemention shows 5%-5.5% rejection using the same three tests, as was shown in 
Figured] (left). 



4. Power estimates 

To measure the performance of various distribution and independence tests, we applied them 
to some well-known processes, including diffusion processes defined by stochastic differential 
equations of the form: 

(6) dX t = A(X t )dt + B{X t )dW t . 

In particular we consider 

(1) Brownian motion with drift, A[Xt) = a, B(Xt) = 1; 

(2) Ornstein-Uhlenbeck process, A(X t ) = —aX t , B(X t ) = 1; 

(3) Feller's square-root diffusion process, A{X t ) = k(/j, — X t ), B{X t ) = \fX~ t \ 

(4) Fractional Brownian motion with Hurst parameter H S (0, 1). 

In each case we simulated 10,000 sample paths of the process in question, applied the tests to 
either the subcrossing data or up/down symmetry data, and report on our results in several 
ways. Graphically, we show the percentage out of 10,000 that was significant at the 5% level 
with our tests of distribution and our tests of independence. We also report results in tables. 
At higher levels, the length of the data drops below the length we feel it appropriate to apply 
a test. In such cases we report the percentage that was significant, both out of 10,000 and 
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Brownian Motion Distr. Test Results 



Brownian Motion Indep. Test Results 




Figure 5. Crossing tree results for standard Brownian Motion. Percentage 
of 10,000 sample paths rejected, with an average of 1250 crossings by time 5. 
Rejection using distribution tests (left) and independence tests (right). Since 
the null hypothesis is satisfied, 5% rejection is expected for all tests. Here, 95% 
confidence intervals are of the order ±0.4%. 



a = 0, n = 1250, S = l/(5\/T0) 




% of all (% of tested; # tested) 


levels 





1 


2 


3 


4 


X 2 (+2 df) 




5.2 


4.9 


3.4 ( 3.7; 9395) 


0.0 ( NaN; 0) 


Twos Test 




4.3 


4.0 


2.9 


1.2 ( 1.2; 9969) 


G Test 




5.3 


5.7 


5.7 


4.6 ( 4.6; 9969) 


KS Discrete 




4.7 


4.4 


3.7 


2.7 


KLP98 Test 




4.5 


3.4 


2.0 


0.0 


Joint Dist. 




4.9 


4.7 


4.7 ( 4.7; 9963) 


0.0 ( 0.0; 36) 


Autocorr. 




4.3 


5.0 


4.9 


2.2 ( 4.8; 4637) 


Runs Test 




4.5 


3.9 


2.6 


0.0 


Larsen Test 




4.7 


5.2 


5.1 ( 5.1; 9995) 


0.7 ( 0.8; 8782) 


Dix.-OBri. 




5.5 


5.6 


4.5 


0.0 ( 0.0; 9969) 


OBri85 




5.4 


5.5 


3.8 ( 4.8; 7970) 


0.0 ( 0.0; 10) 


Runs UD 


4.5 


4.0 


2.5 


0.0 


0.0 


Larsen UD 


5.3 


5.1 


5.0 ( 5.0; 9999) 


0.8 ( 0.8; 9185) 


0.0 ( 0.0; 4208) 


Dix.-OBri. UD 


5.5 


5.5 


4.3 


0.1 ( 0.1; 9963) 


0.0 ( 0.0; 7021) 


OBri85 UD 


5.3 


5.3 


4.3 ( 5.5; 7894) 


0.0 ( 0.0; 77) 


0.0 ( NaN; 0) 



Table 1. Crossing tree results for standard Brownian Motion. Percentage of 
10,000 sample paths rejected, with an average of 1250 crossings by time 5. Since 
the null hypothesis is satisfied, 5% rejection is expected for all tests. At higher 
levels, insufficient data length means some datasets are not tested. 



out of the number tested, along with the number of datasets tested. We do this to distinguish 
between, say, 0% rejected out of 13 tested and 0% rejected out of 9727 tested. The latter 
seems deserving of more weight in our opinion. Park and Vasudev used simulation to test their 
quadratic variation method on processes 1-3 above, using 10,000 sample paths each. So we 
can do direct co mparisons with their results, and with our implementation of the quadratic 
variation method. IPeters and de Vilderl ( 20061 ) do not provide power results. 
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Since the quadratic variation method uses points regularly spaced in time, and the crossing 
tree uses crossi ngs, there is a question of what is a 'direct' comparison. For example, Park and 
Vasudev (|2005T ) simulate processes with regularly spaced times of interval length 1/250 up to 
time 5, so 1250 points, which they argue represents daily information for 5 years. We are free to 
set the scale 5 of the crossing tree, and we determined a value so that, on average, the number 
of crossings in a time interval is the same as the number of points in the quadratic variation 
method in a time interval of the same length. So, we would compare 1250 points using quadratic 
variation with 1250 crossings using the crossing tree, with all parameters of the process kept 
constant. How we determine 5 to do this for various processes is described in Appendix [Bj 



4.1. Brownian motion with drift. Brownian Motion with drift is an example of a process 
that does not satisfy property CLM1 of Corollary [H For drift 1 and 1.5, 1250 crossings were 
simulated 10,000 times each. Figure [6] shows the results for drift 1, for which 5 = 0.06328774784 
was used. 

Of the distribution tests (left panel), only the G test rejects at a noteworthy level relative 
to the 5% level of the tests. The test of Kyriakoussis et al. shows quickly dimininshing power 
as level increases. This is likely because the test statistic is based on an asymptotic Gaussian 
distribution. This is an increasingly poor approximation as the level increases, and so the 
number of subcrossings decreases. 

Amongst the non-up-down independence tests (right panel), rejection between 4% and 6% is 
most common with the autocorrelation test showing rejection around 7.3% at levels 3 and 4. 
So the autocorrelation test demonstrates the most power of these non-up-down independence 
tests. For the up-down symmetry tests, we expect up-down as often as down-up, and each pair 
to be independent of all others because of the independent increments of the process. Thus 
5% rejection, as with standard Brownian motion, would be expected. At higher levels the drift 
dominates the variability in the path. 

Figure [8] (left) shows results from our implementation of the quadratic variation method, 
applied to Brownian motion with drift 1. Clearly the rejection rates, in the range 30-60% are 
larger for this method, but an exact number is difficult to identify. The results for this type of 
process are also unique among all those we've seen in that the standardized mean test shows 
higher power than the Cramer- von Mises test. This is not surprising since the increments being 
tested are being tested as mean 0, although they are not. Park and Vasudev report rejection 
rates of 47% (KS), 56% (CV), and 60% (SM), which are similar to ours, and have the same 
ordering from smallest to largest. Clearly, the quadratic variation method shows higher power 
than the crossing tree at this length. 

Figure [7] shows results for Brownian motion with drift a = 1.5. With larger drift, the smaller 
value 5 = 0.06334057822 gives an average of 1250 crossings by time 5. The larger drift term 
provides some clear differences. Rejection rates from the distribution tests are generally higher. 
In particular, the discrete Kolmogorov-Smirnov and Twos test show rejection around 15.5% 
at level 4, about 10% higher than for drift 1. With the Twos test in particular, the result is 
easily explained. With a stong drift component there will be many crossings with only two 
subcrossings and no excursions to give additional crossings. Thus, 2 subcrossings will occur 
more than 50% of the time (the expected rate under the null hypothesis). As with drift 1, the 
power of the test by Kyriakoussis et al. isn't really comparable at higher levels. 

Amongst the independence tests (right panel), the autocorrelation test is striking that it 
rejects 20% at level 4. But, as with drift 1, these tests still show low power compared to the 
quadratic variation method. With our implementation, 89-91% of paths are rejected. Park 
and Vasudev reported identical rejection rates: 82% (KS), 89% (CV) and 91% (SM). So, the 
quadratic variation method again shows higher power to reject Brownian motion with drift. 

Of all the processes we consider here, the higher power of the quadratic variation method over 
the crossing tree is particular to Brownian motion with drift. We did additional simulations 
(not shown) to explore the crossing tree further for this process. Again using drift 1, and the 
same corresponding value of 5 for the crossing tree, datasets with more crossings were simulated. 
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Especially for these two explorations, only 1000 datasets were used, providing lengths of either 
10,000 crossings or 20,000 crossings. So these are 8 and 16 times longer, respectively, than for 
the results previously described. The additional length provides more data at lower levels, and 
additional levels at which to apply tests. For length 10,000, the autocorrelation test rejected 
83% at level 5 testing 998 paths. The Twos test, the x 2 test an d the KS discrete test rejected 
about 67%. With 20,000 crossings those tests rejected over 95% at level 5. While higher levels 
were not tested, it is likely that there was sufficient length to test level 6, with higher rejection 
rates again. 




Figure 6. Crossing tree results for Brownian Motion with drift a = 1. Percent- 
age of 10,000 sample paths rejected, with an average of 1250 crossings by time 
5. Rejection using distribution tests (left) and independence tests (right). Here, 
95% confidence intervals are 1% or smaller. 



a = l,n = 1250, 5 = 0.06328774784 




% of all (% of tested; # tested) 


levels 





1 


2 


3 


4 


5 


x 2 (+2 df) 




4.7 


4.2 


4.6 ( 4.7; 9721) 


0.0 (100.0; 2) 


0.0 ( NaN; 0) 


Twos Test 




4.5 


4.3 


3.9 


5.1 ( 5.1; 9998) 


0.0 ( 0.0; 9397) 


G Test 




5.3 


5.6 


7.0 


10.5 ( 10.5; 9998) 


0.8 ( 0.8; 9397) 


KS Discrete 




4.8 


4.5 


4.4 


5.4 


0.0 


KLP98 Test 




4.5 


3.5 


1.9 


0.0 


0.0 


Joint Dist. 




4.7 


4.7 


4.0 ( 4.0; 9998) 


0.0 ( 1.0; 289) 


0.0 ( NaN; 0) 


Autocorr. 




4.3 


4.9 


7.3 


7.4 ( 10.0; 7384) 


0.0 ( 0.0; 7) 


Runs Test 




4.3 


4.1 


2.6 


0.0 


0.0 


Larsen Test 




5.0 


5.6 


4.9 ( 4.9; 9999) 


1.8 ( 1.9; 9660) 


0.0 ( 0.0; 8507) 


Dix.-OBri. 




5.7 


5.2 


4.7 


0.0 ( 0.0; 9998) 


0.0 ( 0.0; 9397) 


OBri85 




5.3 


5.5 


4.9 ( 5.9; 8358) 


0.0 ( 0.0; 41) 


0.0 ( NaN; 0) 


Runs UD 


4.5 


3.5 


2.5 


0.0 


0.0 


0.0 


Larsen UD 


5.0 


4.9 


4.8 


0.4 ( 0.4; 8683) 


0.0 ( 0.0; 2410) 


0.0 ( 0.0; 135) 


Dix.-OBri. UD 


5.4 


4.7 


4.1 


0.0 ( 0.0; 9903) 


0.0 ( 0.0; 4416) 


0.0 ( 0.0; 256) 


OBri85 UD 


5.7 


5.4 


4.1 ( 5.3; 7675) 


0.0 ( 0.0; 19) 


0.0 ( NaN; 0) 


0.0 ( NaN; 0) 



Table 2. Crossing tree results for Brownian Motion with drift a = 1. Percent- 
age of 10,000 sample paths rejected, with an average of 1250 crossings by time 
5. At higher levels, insufficient data length means some datasets are not tested. 
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Brownian Motion with Drift 1.5 Distr. Test Results 



X (2 extra df) 

- ■ - 1 Twos Test 

G Test 

-e — KS Discrete 
KLP98 



Brownian Motion with Drift 1.5 Indep. Test Results 



-Joint Dist. 

■ Autocorr. 
Runs Test 
Larsen 

Dixon-O'Brien 
0'Brien85 

- Runs Test UpDown 

■ Larsen UpDown 

- Dixon-O'Brien UpDown 

■ 0'Brien85 UpDown 





Figure 7. Crossing tree results for Brownian Motion with drift a = 1.5. Per- 
centage of 10,000 sample paths rejected, with an average of 1250 crossings by 
time 5. Rejection using distribution tests (left) and independence tests (right). 
Here, 95% confidence intervals are 1% or smaller. 



a = 1.5, n = 1250, 8 = 0.06334057822 




% of all (% of tested; # tested) 


levels 





1 


2 


3 


4 


5 


X 2 (+2 df) 




4.6 


3.5 


7.9 ( 8.0; 9924) 


0.2 (100.0; 18) 


0.0 ( NaN; 0) 


Twos Test 




4.6 


5.2 


7.2 


15.4 


1.0 ( 1.0; 9923) 


G Test 




5.7 


6.6 


11.7 


25.6 


7.2 ( 7.3; 9923) 


KS Discrete 




4.9 


5.8 


7.6 


15.5 


1.0 


KLP98 Test 




4.3 


3.0 


1.9 


0.1 


0.0 


Joint Dist. 




4.6 


4.6 


3.9 ( 3.9; 9999) 


0.6 ( 4.3; 1464) 


0.0 ( NaN; 0) 


Autocorr. 




4.4 


4.9 


11.9 


19.7 ( 22.3; 8853) 


0.0 ( 1.7; 58) 


Runs Test 




4.0 


4.0 


2.8 


0.1 


0.0 


Larsen Test 




5.2 


5.5 


5.1 


3.3 ( 3.3; 9921) 


0.0 ( 0.0; 9708) 


Dix.-OBri. 




5.5 


5.7 


5.1 


0.2 


0.0 ( 0.0; 9923) 


OBri85 




5.2 


5.2 


4.9 ( 5.7; 8550) 


0.0 ( 0.0; 69) 


0.0 ( NaN; 0) 


Runs UD 


4.2 


3.7 


2.4 


0.0 


0.0 


0.0 


Larsen UD 


5.1 


5.0 


4.7 ( 4.7; 9998) 


0.1 ( 0.1; 8179) 


0.0 ( 0.0; 1416) 


0.0 ( 0.0; 37) 


Dix.-OBri. UD 


5.6 


5.0 


3.9 


0.0 ( 0.0; 9722) 


0.0 ( 0.0; 2666) 


0.0 ( 0.0; 58) 


OBri85 UD 


5.7 


5.9 


3.1 ( 4.3; 7210) 


0.0 ( 0.0; 5) 


0.0 ( NaN; 0) 


0.0 ( NaN; 0) 



Table 3. Crossing tree results for Brownian Motion with drift a = 1.5. Per- 
centage of 10,000 sample paths rejected, with an average of 1250 crossings by 
time 5. At higher levels, insufficient data length means some datasets are not 
tested. 



4.2. Ornstein-Uhlenbeck process. For the Ornstein-Uhlenbeck process 

dX{t) = -aX{t)dt + adW(t) 
with a, a > we considered a = 1 and both a = 8 and a = 10. Park and Vasudev (|2005l ) argue 



that these values of a (relative to a) are reasonable for short term interest rate data. This 
process has a stationary N(0, a 2 /(2a)) distribution. In all cases it was the stationary process 
we worked with. 

First we simulated 1250 crossings of the process with a = 8, a = 1. The value 5 = 0.063015 
was determined to give an average of 1250 crossings in time 5, as desired. Figure [9] (left) shows 
the results from the distribution tests. The x 2 test with two extra degrees of freedom is most 
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Figure 8. Rejection rates using sample quadratic variation for 10,000 sample 
paths of length 1250 for standard Brownian motion with drift 1 (left) and 1.5 
(right), at times spaced every 1/250. Here, 95% confidence intervals are 1% or 
smaller. 



powerful, rejecting over 11% at level 2. The lack of power at level 3 appears to be an artifact of 
data length relative to our cutoffs. We do not apply the x 2 test to datasets shorter than length 
14, even with our use of empirical critical values. The KLP98 test does much better with this 
process, rejecting almost 9% by level 3. Interestingly, the discrete KS test rejects at about 8%, 
but at level 2, suggesting it is truly sensitive to different features than KLP98. 

Figure [9] (right) shows the results from the independence tests. Here the joint distribution 
test dominates, rejecting over 14% at level 3 while the other tests exceed their significance level 
of 5% barely at all. The Runs test is also noteworthy since it's rejection rate doesn't fall quickly, 
as it does with Brownian motion and Brownian motion with drift. In fact, it is superior to the 
other three "classical" tests at level 3. 

An additional comment on the joint distribution test is in order. That test makes an assump- 
tion about independence and an assumption about the marginal distribution of the number of 
subcrossings. Considering the large rejection rates from the distribution tests, it could be a 
violation of the distribution assumption that causes the joint distribution test to reject so many 
paths. However, when we permute the subcrossing data {Z l k } we find the joint distribution test 
then rejects about 4% of all paths at level 3. So 10% of the paths are rejected by the joint 
distribution test due to bivariate dependence in the data. Phrased another way, of the paths 
rejected by the joint distribution test, almost 70% are rejected due to bivariate dependence. At 
levels one and two, the results after permuting are virtually indistiguishable from those without 
permuting, and reject about 5%, which is the level of the test. These results were consistent for 
all the OU processes we studied. 

These results justify rejecting 11-15% of the sample paths. By comparison, results from our 
implementation of the quadratic variation method are shown in Figure [15j Rejection rates are 
around 0.7% (KS) and 1% (CVM). The standardized mean (SM) test consistently has no power. 
So the cr ossing tree method appears to reject 10-15 times as many sample paths. Park and 
Vasudev ( 20051 ) reported no results for length 1250, except to say the test had "low power". 
This is not surprising since their implementation compromised the power by searching for the 
time interval length that makes the transformed data most like A r (0, 1). 

Figure [11] shows results for the same Ornstein-Uhlenbeck process, but with 5000 crossings. 
The additional length provides longer datasets for our tests, and the result is most noticeable 
at level three. As with the shorter datasets, the x 2 test rejects the most paths, followed by the 
KLP98 test. But with the longer datasets, considerably more paths are rejected. With our x 2 
test, 73% are rejected, compared to just 11% with 1250 crossings. From the independence tests, 
the joint distribution test again dominates, even more so now, rejecting almost 87%. The Runs 
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test again shows the most power of the classical "independence" tests, although at 23% all of 
the distribution tests except for the Twos test are more powerful. 

The results for 5000 crossings suggest the crossing tree rejects around 73-87% of the sample 
paths. Again, this is better than the quadratic variation method. Figure[T6l(left) shows rej ection 
rates of about 47% (KS) and 67% (CVM) with our implemention. Park and Vasudev ( 20051 ) 
reported even lower results: 13% (KS) and 24% (CVM). 

For comparison we also simulated crossings from the Ornstein-Uhlenbeck process with a = 10, 
hence a stronger mean reversion effect. The scale parameter for the crossing tree was found 
to be 0.062945. Figure [10] shows the results are generally comparable, with slightly higher 
rejection rates, to those for a = 8. Again, amongst the distribution tests, our x 2 test rejects the 
most while the KLP98 test rejects the secondmost, again at levels two and three respectively. 
Amongst the independence tests the joint distribution test again rejects considerably more 
than the others. One noteworthy difference from the shorter dataset is that now the Runs 
test distinguishes itself with noticeably more power than the other "classical" tests at level 3. 
Primarily from the x 2 an d joint distribution tests, the crossing tree method rejects about 15% 
of the sample paths. Figure [15] (right) shows with our implemenation of the quadratic variation 
method rejection rates of only 1.2% (KS) and 1.8% (CVM). Again, Park and Vasudev reported 
no results for this length. 

Figure [12] shows results for 5000 crossings and a = 10 are comparable to those for 1250 
crossings. From the distribution tests, our x 2 test rejects the most, and the KLP98 test rejects 
the second most. Amongst the independence tests, the joint distribution test rejects the most 
while the Runs test rejects the second most. Again, with additional crossings the rejection 
rates are much higher, at around 97% for the joint distribution test and 78% for our x 2 test. 
Again these rates are higher (but generally comparable) to those from our implementation of 
the quadratic variation method. Figure [TU1 (right) shows rejection rates of 69% (KS) and 87% 
(CVM). Again, our quadratic variation rates are higher than those of Park and Vasudev, who 
reported rates of 31% (KS) and 52% (CVM). 

Finally, we simulated the process with a = 1, a = 1, n = 5000 to investigate a weaker mean 
reversion effect. Results are shown in Figure Q3] and Table [9] Because the mean reversion 
is weaker, it is much harder to distinguish this alternative from Brownian motionn and the 
rejection rates are much closer to their signficance level of 5%.. At level 3 our distribution and 
joint distribution tests reject about 7%. Figure [T71 shows results for our implementation of the 
quadratic variation-based method. Neither test performs well, with rejections rates 0.17% (KS) 
and 0.12%(CVM) well below 1%. Interestingly, for these parameters rejection rates are higher 
with shorter (n = 1250) datasets, unlike our other results using -sample variation. 

In summary, for the Ornstein-Uhlenbeck processes considered here, the crossing tree method 
shows higher power. With strong mean reversion (a = 8 or 10, a = 1), for length 1250 the 
crossing tree method rejects between 8 and 15 times more paths. For length 5000 the difference 
is less dramatic, but an additional 10% are rejected. With stronger mean reversion (a = 1, 
a = 1) the difference is more dramatic, with the crossing tree method rejection around 40 times 
as many paths. 
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Figure 9. Crossing tree results for the Ornstein-Uhlenbeck process with a = 8, 
a = 1, n = 1250, 5 = 0.063015. Percentage of 10,000 sample paths rejected using 
distribution tests (left) and independence tests (right). Here, 95% confidence 
intervals are 1% or smaller. 



a = 8,a = l,n = 1250, S = 0.063015 




% of all (% of tested; # tested) 


levels 





1 


2 


3 


4 


X 2 (+2 df) 




7.5 


11.0 


0.7 ( 1.8; 4110) 


0.0 ( NaN; 0) 


Twos Test 




5.0 


5.6 


1.3 


0.0 ( 0.0; 325) 


G Test 




6.1 


7.3 


6.8 


0.0 ( 0.0; 325) 


KS Discrete 




5.8 


7.7 


5.6 


0.1 


KLP98 Test 




4.7 


4.4 


8.8 


0.0 


Joint Dist. 




5.7 


7.7 


14.5 ( 17.7; 8210) 


0.0 ( NaN; 0) 


Autocorr. 




4.5 


4.9 


5.8 ( 5.8; 9908) 


0.0 ( NaN; 0) 


Runs Test 




4.2 


4.2 


6.0 


0.0 


Larsen Test 




5.0 


5.0 


2.4 ( 2.4; 9965) 


0.0 ( 0.0; 308) 


Dix.-OBri. 




5.5 


5.3 


1.5 


0.0 ( 0.0; 325) 


OBri85 




5.3 


4.9 ( 4.9; 9999) 


4.5 ( 11.9; 3820) 


0.0 ( NaN; 0) 


Runs UD 


4.6 


3.6 


3.1 


0.2 


0.0 


Larsen UD 


5.1 


5.2 


5.5 


1.7 ( 1.9; 9144) 


0.0 ( 0.0; 142) 


Dix.-OBri. UD 


5.4 


5.4 


5.8 


0.3 ( 0.3; 9941) 


0.0 ( 0.0; 290) 


OBri85 UD 


5.3 


5.5 


5.1 ( 6.0; 8538) 


0.0 ( 0.0; 194) 


0.0 ( NaN; 0) 



Table 4. Crossing tree results for the Ornstein-Uhlenbeck process with a = 8, 
a = 1, n = 1250, 5 = 0.063015. Percentage of 10,000 sample paths rejected, 
with an average of 1250 crossings by time 5. At higher levels, insufficient data 
length means some datasets are not tested. 
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Figure 10. Crossing tree results for the Ornstein-Uhlenbeck process with a = 8, 
a = 1, n = 5000, 5 = 0.063015. Percentage of 10,000 sample paths rejected using 
distribution tests (left) and independence tests (right). Here, 95% confidence 
intervals are 1% or smaller. 



a = 8, a = 1, n = 5000, 5 = 0.063015 




% of all (% of tested; # tested) 


levels 





1 


2 


3 


4 


X 2 (+2 df) 




10.5 


25.0 


72.6 


0.0 ( NaN; 0) 


Twos Test 




7.7 


12.7 


3.5 


0.0 ( 0.0; 1340) 


G Test 




7.3 


16.1 


32.5 


0.0 ( 0.0; 1340) 


KS Discrete 




9.1 


18.1 


29.5 


0.2 


KLP98 Test 




4.9 


6.0 


45.5 


0.0 


Joint Dist. 




7.8 


11.8 


87.0 


0.0 ( NaN; 0) 


Autocorr. 




5.1 


5.2 


13.0 


0.0 ( 0.0; 2) 


Runs Test 




4.7 


4.8 


23.1 


0.0 


Larsen Test 




4.7 


4.9 


3.3 


0.0 ( 0.0; 1314) 


Dix.-OBri. 




5.3 


5.9 


1.4 


0.0 ( 0.0; 1340) 


OBH85 




5.1 


5.6 


16.6 ( 16.8; 9851) 


0.0 ( NaN; 0) 


Runs UD 


4.9 


4.3 


4.3 


5.2 


0.0 


Larsen UD 


5.1 


5.0 


6.0 


7.3 ( 7.3; 9988) 


0.0 ( 0.0; 662) 


Dix.-OBri. UD 


5.2 


5.5 


6.9 


7.4 


0.0 ( 0.0; 1303) 


OBri85 UD 


5.4 


5.4 


5.9 


4.7 ( 5.6; 8298) 


0.0 ( NaN; 0) 



Table 5. Crossing tree results for the Ornstein-Uhlenbeck process with a = 8, 
a = 1, n = 5000, 5 = 0.063015. Percentage of 10,000 sample paths rejected, 
with an average of 5000 crossings by time 20. At higher levels, insufficient data 
length means some datasets are not tested. 
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Figure 11. Crossing tree results for the Ornstein-Uhlenbeck process with 
a = 10, a = 1, n = 1250, 5 = 0.062945. Percentage of 10,000 sample paths 
rejected using distribution tests (left) and independence tests (right). Here, 95% 
confidence intervals are 1% or smaller. 



a = 10, a = 1, n = 1250, 5 = 0.062945 




% of all (% of tested; # tested) 


levels 





1 


2 


3 


4 


X 2 (+2 df) 




8.7 


13.7 


0.4 ( 1.6; 2275) 


0.0 ( NaN; 0) 


Twos Test 




5.2 


6.2 


0.7 ( 0.7; 9988) 


0.0 ( 0.0; 58) 


G Test 




6.2 


8.5 


6.4 ( 6.4; 9988) 


0.0 ( 0.0; 58) 


KS Discrete 




6.3 


9.3 


6.3 


0.0 


KLP98 Test 




4.6 


4.9 


11.3 


0.0 


Joint Dist. 




6.0 


8.7 


15.6 ( 24.4; 6409) 


0.0 ( NaN; 0) 


Autocorr. 




4.6 


4.7 


5.0 ( 5.3; 9577) 


0.0 ( NaN; 0) 


Runs Test 




4.5 


4.0 


7.2 


0.0 


Larsen Test 




5.2 


5.2 


1.7 ( 1.7; 9923) 


0.0 ( 0.0; 56) 


Dix.-OBri. 




5.6 


5.3 


0.6 ( 0.6; 9988) 


0.0 ( 0.0; 58) 


OBri85 




5.2 


5.2 ( 5.2; 9999) 


3.3 ( 14.6; 2274) 


0.0 ( NaN; 0) 


Runs UD 


4.7 


3.9 


3.1 


0.1 


0.0 


Larsen UD 


5.1 


5.4 


6.0 


1.1 ( 1.2; 9004) 


0.0 ( 0.0; 27) 


Dix.-OBri. UD 


5.9 


5.5 


6.2 


0.1 ( 0.1; 9906) 


0.0 ( 0.0; 54) 


OBH85 UD 


5.5 


5.5 


4.6 ( 5.4; 8617) 


0.0 ( 0.0; 80) 


0.0 ( NaN; 0) 



Table 6. Crossing tree results for the Ornstein-Uhlenbeck process with a = 10, 
a = 1, n = 1250, 5 = 0.062945. Percentage of 10,000 sample paths rejected, 
with an average of 1250 crossings by time 5. At higher levels, insufficient data 
length means some datasets are not tested. 
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OU Distr. Test Results 
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Figure 12. Crossing tree results for the Ornstein-Uhlenbeck process with 
a = 10, a = 1, n = 5000, 5 = 0.062945. Percentage of 10,000 sample paths 
rejected using distribution tests (left) and independence tests (right). Here, 95% 
confidence intervals are 1% or smaller. 



a = 10, a = 1, n = 5000, S = 0.062945 




% of all (% of tested; # tested) 


levels 





1 


2 


3 


4 


X 2 (+2 df) 




12.7 


34.3 


77.5 


0.0 ( NaN; 0) 


Twos Test 




9.8 


16.1 


1.3 


0.0 ( 0.0; 250) 


G Test 




8.9 


22.9 


45.6 


0.0 ( 0.0; 250) 


KS Discrete 




11.7 


24.9 


52.3 


0.0 


KLP98 Test 




4.8 


7.3 


64.3 


0.0 


Joint Dist. 




9.3 


16.3 


97.4 


0.0 ( NaN; 0) 


Autocorr. 




5.0 


5.2 


11.6 


0.0 ( 0.0; 1) 


Runs Test 




4.7 


4.6 


47.2 


0.0 


Larsen Test 




5.2 


5.0 


1.9 


0.0 ( 0.0; 250) 


Dix.-OBri. 




5.8 


5.5 


0.7 


0.0 ( 0.0; 250) 


OBri85 




5.2 


5.8 


26.4 ( 28.2; 9374) 


0.0 ( NaN; 0) 


Runs UD 


4.6 


4.3 


5.3 


3.6 


0.0 


Larsen UD 


4.9 


5.3 


6.3 


6.1 ( 6.1; 9970) 


0.0 ( 0.0; 117) 


Dix.-OBri. UD 


5.2 


6.0 


8.3 


5.5 


0.0 ( 0.0; 246) 


OBri85 UD 


5.4 


5.7 


6.5 


4.0 ( 5.1; 7809) 


0.0 ( NaN; 0) 



Table 7. Crossing tree results for the Ornstein-Uhlenbeck process with a = 10, 
a = 1, n = 5000, 5 = 0.062945. Percentage of 10,000 sample paths rejected, 
with an average of 5000 crossings by time 20. At higher levels, insufficient data 
length means some datasets are not tested. 
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OU Distr. Test Results 



OU Indep. Test Results 
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FIGURE 13. Crossing tree results for the Ornstein-Uhlenbeck process with a = 1, 
a = 1, n = 1250, 5 = 0.063220. Percentage of 10,000 sample paths rejected using 
distribution tests (left) and independence tests (right). Here, 95% confidence 
intervals are 1% or smaller. 



a = 1, a = 1, n = 1250, 5 = 0.063220 





% of all (% of tested; # tested) 


levels 





1 


2 


3 


4 


X 2 (+2 df) 




5.5 


5.8 


3.2 ( 3.6; 8964) 


0.0 ( NaN; 0) 


Twos Test 




4.4 


4.0 


2.8 


0.6 ( 0.6; 9794) 


G Test 




5.1 


5.9 


5.9 


3.1 ( 3.2; 9794) 


KS Discrete 




4.6 


4.9 


4.1 


3.7 


KLP98 Test 




4.3 


3.5 


1.8 


0.0 


Joint Dist. 




5.2 


5.2 


5.2 ( 5.3; 9953) 


0.0 ( 0.0; 16) 


Autocorr. 




5.0 


5.1 


3.8 


1.2 ( 4.0; 3044) 


Runs Test 




5.1 


3.6 


2.5 


0.0 


Larsen Test 




5.2 


5.0 


4.6 ( 4.6; 9992) 


0.3 ( 0.3; 8063) 


Dix.-OBri. 




5.5 


5.1 


4.2 


0.0 ( 0.0; 9794) 


OBri85 




5.6 


5.5 


4.2 ( 5.5; 7632) 


0.0 ( 0.0; 5) 


Runs UD 


4.7 


3.9 


2.8 


0.0 


0.0 


Larsen UD 


5.4 


5.0 


5.0 ( 5.0; 9999) 


0.9 ( 1.0; 9274) 


0.0 ( 0.0; 4678) 


Dix.-OBri. UD 


5.6 


5.5 


4.6 


0.1 ( 0.1; 9984) 


0.0 ( 0.0; 7612) 


OBri85 UD 


5.7 


5.9 


4.4 ( 5.5; 7962) 


0.0 ( 0.0; 69) 


0.0 ( NaN; 0) 



Table 8. Crossing 
a = 1, n = 1250, 
with an average of 
length means some 



; tree results for the Ornstein-Uhlenbeck process with a = 1, 
5 = 0.063220. Percentage of 10,000 sample paths rejected, 
1250 crossings by time 5. At higher levels, insufficient data 
datasets are not tested. 
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Figure 14. Crossing tree results for the Ornstein-Uhlenbeck process with a = 1, 
a = 1, n = 5000, 5 = 0.063220. Percentage of 10,000 sample paths rejected using 
distribution tests (left) and independence tests (right). Here, 95% confidence 
intervals are 1% or smaller. 



a = 1, a = 1, n = 5000, S = 0.063220 




% of all (% of tested; # tested) 


levels 





1 


2 


3 


4 


X 2 (+2 df) 




5.5 


6.6 


7.3 


2.4 ( 3.2; 7642) 


Twos Test 




4.7 


4.7 


4.4 


2.6 


G Test 




5.2 


5.5 


6.0 


6.2 


KS Discrete 




5.1 


4.9 


5.4 


5.9 


KLP98 Test 




4.9 


4.4 


3.7 


3.1 


Joint Dist. 




5.7 


5.1 


6.2 


6.9 ( 7.1; 9725) 


Autocorr. 




5.1 


4.6 


4.9 


3.0 ( 3.0; 9999) 


Runs Test 




4.6 


4.6 


4.3 


2.5 


Larsen Test 




5.3 


5.1 


4.9 


4.3 ( 4.3; 9989) 


Dix.-OBri. 




5.1 


5.8 


5.4 


3.5 


OBH85 




5.3 


5.7 


5.3 


3.9 ( 5.9; 6624) 


Runs UD 


4.8 


4.4 


4.0 


2.8 


0.1 


Larsen UD 


5.3 


4.9 


5.0 


5.0 ( 5.0; 9999) 


1.7 ( 1.8; 9319) 


Dix.-OBri. UD 


5.5 


5.8 


5.2 


4.5 


0.2 ( 0.2; 9968) 


OBH85 UD 


5.2 


5.3 


5.5 


4.6 ( 5.4; 8392) 


0.0 ( 0.7; 295) 



Table 9. Crossing tree results for the Ornstein-Uhlenbeck process with a = 1, 
a = 1, n = 5000, 5 = 0.063220. Percentage of 10,000 sample paths rejected, 
with an average of 5000 crossings by time 20. At higher levels, insufficient data 
length means some datasets are not tested. 
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Figure 15. Rejection rates using sample quadratic variation for Ornstein- 
Uhlenbeck processes. Short (n = 1250) datasets. Left has drift coefficient a = 8, 
right has drift coefficient a = 10. Here, 95% confidence intervals are < 1%. 




Figure 16. Rejection rates using sample quadratic variation for Ornstein- 
Uhlenbeck processes. Long (n = 5000) datasets. Left has drift coefficient a = 8, 
right has drift coefficient a = 10. Here, 95% confidence intervals are < 1%. 
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Figure 17. Rejection rates using sample quadratic variation for Ornstein- 
Uhlenbeck processes with drift coefficient a = 1. Left panel uses short (n = 1250) 
datasets, right panel uses long (n = 5000) datasets. Here, 95% confidence inter- 
vals are 1% or smaller. 
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4.3. Feller's Square-root Diff usion. Anot her example of a mean reverting process in use 
i s Feller's square root diffusion ( Feller . 195ll ). which underlies the Cox-Ingersoll-Ross model 



(|Cox et all Il985l ). It is described by 



dX(t) = K(fi - X(t))dt + <jy/X(t)dW(t) 

for K,fi,a > 0, 2nfj,/a 2 > 1. The stationary distribution is Gamma(a, b) with shape parameter 
a = 2Kfi/a 2 and scale parameter b = a 2 /(2k). We assume the stationary process throughout. 
We again considered two lengths (1250 and 5000) and two parameter values (k = 6 and k = 8), 
with a = 1 and [i = 0.2. Note that the mean is non-zero for this process. 

To simulate 1250 crossings with k = 6, the value 5 = 0.028163 was used for the size of the 
level crossings. Figure [TH] (left) shows the results from our distribution tests. As with the 
Ornstein-Uhlenbeck process, our x 2 test seems to have the most power, rejecting almost 19% 
at level 2. At level 3 it again rejects less than other distribution-based tests due to insufficient 
data length. The KS discrete and G test are comparable at level 2, with the former showing 
more power at level 3, although almost all the paths were tested using both tests. For this 
process, the KLP98 test seems to have little power. 

Figure [18] (right) shows the results from our independence tests. As with the Ornstein- 
Uhlenbeck process, the joint distribution test appears to have the most power, although its 
power does not dominate in these tests. In fact, its power appears to exceed only the Twos test 
and the KLP98 test amongst our distribution tests. The other independence tests don't really 
reject beyond their significance level. 

These results suggest 11-19% of the sample paths are rejected. As with the Ornstein- 
Uhlenbeck process, this is much better than with the quadratic variation method. Figure 
[22] shows based on our implementation we would reject about 0.3% with both the KS and CVM 
tests. Park and Vasudev report no results for length 1250 due to low power. 

Figure [19] shows results from the same process, but with 5000 crossings simulated using 
5 = 0.028474. With the additional length, more paths are rejected. The left panel shows, again, 
our x 2 test rejects the most, 72% at level 3. The KS discrete test is not far behind, rejecting 65% 
at level3. From our independence tests, again the joint distribution test dominates, rejecting 
about 67% at level 3. At level 2 its power is less than all but the KLP98 test. 

The Dixon-O'Brien symmetry test on the updown pairs (Dix.-OBri. UD) is noteworthy here, 
in that it rejects about 15% of the paths at level 2. This test rejects a path if the more numerous 
symbol in the binary sequence (either or 1, representing up-down and down-up pairs), shows 
clustering. This is consistent with a mean reverting process in that the first crossing of an 
up-down pair is more likely to be in the direction of the mean. And so, amongst only down-up 
and up-down pairs, clustering is consistent with a path that favors return to the mean. 

In summary, for this process with these parameters our results show 62-67% of the paths 
are rejected. Our implementation of the quadratic variation method (Figure I2"3"j) suggests that 
method rejects from 14% (KS) to 24% (CVM) of paths. So the crossing tree method appears 
more powerful, rejecting an additional 48% of the paths. Park and Vasudev again reported 
smaller rejection rates: 4% (KS) and 9% (CVM). 

For comparison we also used drift coefficient k = 8. Figure [20] shows our results for 1250 
simulated crossings using 5 = 0.028276. The results are qualitatively similar to those for k = 6 
and the same length. This was also seen with the Ornstein-Uhlenbeck process when the drift 
coefficient was increased. A difference worth noting is that at level 3 the KS discrete test rejects 
about 10% more paths than the x 2 tests. This is a reversal of usual results in which the x 2 
test shows the highest power among the tests of distribution. In summary, for this process the 
crossing tree would reject 15-18% of the paths. Figure [22] (right) shows, with our quadratic 
variation implemenation, only 0.6-0.7% of paths are rejected, continuing the results for k = 6 
showing increased power from the crossing tree. 

Finally, with drift coefficient k = 8 we also simulated 5000 crossings using 5 = 0.028330. 
Figure [21] shows our results, which are similar to those for k = 6 with length 5000. The power 
of the joint distribution tests again dominates. The Dixon-O'Brien symmetry test (Dix.-OBri. 
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Figure 18. Crossing tree results for Feller's square root diffusion process with 
K = 6, a = 1, n = 0.2, n = 1250, 5 = 0.028163. Percentage of 10,000 sample 
paths rejected using distribution tests (left) and independence tests (right). Here, 
95% confidence intervals are 1% or smaller. 



k = 6, fi = 0.2, a = l,n = 1250, 5 = 0.028163 




% of all (% of tested; # tested) 


levels 





1 


2 


3 


4 


X 2 (+2 df) 




8.9 


18.5 


0.4 ( 2.5; 1731) 


0.0 (100.0; 1) 


Twos Test 




5.3 


7.8 


3.0 ( 3.0; 9967) 


0.1 ( 0.5; 1276) 


G Test 




6.7 


11.3 


11.9 ( 12.0; 9967) 


0.1 ( 0.6; 1276) 


KS Discrete 




6.2 


12.7 


14.6 


0.6 


KLP98 Test 




4.7 


6.3 


6.4 


0.0 


Joint Dist. 




6.0 


10.7 


6.2 ( 11.9; 5243) 


0.0 (100.0; 1) 


Autocorr. 




4.9 


4.9 


1.5 ( 1.6; 9134) 


0.0 ( 7.7; 52) 


Runs Test 




4.5 


3.8 


1.6 


0.0 


Larsen Test 




5.7 


5.1 


2.6 ( 2.7; 9509) 


0.0 ( 0.0; 1003) 


Dix.-OBri. 




5.5 


5.4 


1.1 ( 1.1; 9967) 


0.0 ( 0.0; 1276) 


OBH85 




5.5 


5.4 ( 5.4; 9994) 


1.6 ( 6.3; 2467) 


0.0 ( NaN; 0) 


Runs UD 


4.4 


3.7 


3.7 


0.0 


0.0 


Larsen UD 


4.7 


6.1 


6.8 ( 6.8; 9999) 


0.1 ( 0.1; 9056) 


0.0 ( 0.0; 874) 


Dix.-OBri. UD 


5.7 


6.0 


7.8 


0.0 ( 0.0; 9840) 


0.0 ( 0.0; 1082) 


OBri85 UD 


5.8 


5.5 


3.8 ( 5.6; 6826) 


0.0 ( 0.0; 5) 


0.0 ( NaN; 0) 



Table 10. Crossing tree results for Feller's square root diffusion process with 
K = 6, a = 1, n = 0.2, n = 1250, 5 = 0.028163. Percentage of 10,000 sample 
paths rejected, with an average of 1250 crossings in time 5. At higher levels, 
insufficient data length means some datasets are not tested. 



UD) is again noteworthy at level 2 for rejecting about 16% of the paths, showing evidence of 
clustering in the excursions, consistent with reversion to the mean. In summary, these results 
suggest the crossing tree rejects about 77-81% of sample paths. This is almost 30% better than 
the quadratic variation method (Figure [23]) which rejects about 30% (KS) and 50% (CVM). 
Again, Park and Vasudev report lower rates: 14% (KS) and 26% (CVM). 

It is important to note here that for all these processes we actually did the tests three times, 
building the crossing tree relative to a value 5q, where 5q is one of 0, A(l) (the first data point), 
or the mean of the first 30 crossings built about a temporary crossing tree based at 0. It is 
results for the last choice we report since that has shown the highest power. For example, with 
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Figure 19. Crossing tree results for Feller's square root diffusion process with 
K = 6, a = 1, n = 0.2, n = 5000, 5 = 0.028474. Percentage of 10,000 sample 
paths rejected using distribution tests (left) and independence tests (right). Here, 
95% confidence intervals are 1% or smaller. 



K = 6,fi = 0.2, a = 1, n = 5000, S = 0.028474 




% of all (% of tested; # tested) 


levels 





1 


2 


3 


4 


5 


X 2 (+2 df) 




13.5 


50.0 


71.6 


0.0 ( 1.5; 135) 


0.0 ( NaN; 0) 


Twos Test 




10.2 


24.4 


19.7 


0.1 ( 0.2; 3569) 


0.0 ( 0.0; 98) 


G Test 




9.8 


36.4 


50.9 


0.3 ( 0.9; 3569) 


0.0 ( 1.0; 98) 


KS Discrete 




13.3 


36.6 


64.9 


1.4 


0.1 


KLP98 Test 




5.5 


12.5 


46.6 


0.1 


0.0 


Joint Dist. 




9.4 


25.1 


66.5 


0.1 ( 5.8; 258) 


0.0 ( NaN; 0) 


Autocorr. 




5.0 


6.4 


0.8 


0.1 ( 3.7; 377) 


0.0 ( 33.3; 3) 


Runs Test 




4.6 


4.7 


6.9 


0.1 


0.0 


Larsen Test 




4.8 


5.6 


4.1 


0.1 ( 0.3; 3015) 


0.0 ( 0.0; 88) 


Dix.-OBri. 




5.4 


6.1 


3.4 


0.1 ( 0.2; 3569) 


0.0 ( 0.0; 98) 


OBH85 




5.7 


6.5 


9.6 ( 10.7; 8982) 


0.1 ( 7.4; 162) 


0.0 ( NaN; 0) 


Runs UD 


4.6 


5.2 


10.1 


1.1 


0.0 


0.0 


Larsen UD 


5.0 


5.4 


7.6 


2.3 ( 2.3; 9940) 


0.0 ( 0.0; 3280) 


0.0 ( 0.0; 58) 


Dix.-OBri. UD 


5.8 


8.0 


15.1 


2.1 


0.0 ( 0.0; 3456) 


0.0 ( 0.0; 65) 


OBri85 UD 


5.7 


5.9 


7.0 ( 7.0; 9977) 


0.2 ( 5.8; 412) 


0.0 ( NaN; 0) 


0.0 ( NaN; 0) 



Table 11. Crossing tree results for Feller's square root diffusion process with 
K = 6, a = 1, n = 0.2, n = 5000, 5 = 0.028474. Percentage of 10,000 sample 
paths rejected, with an average of 5000 crossings in time 20. At higher levels, 
insufficient data length means some datasets are not tested. 



the Feller processes with k = 8, n = 5000 the other two choices were comparable, rejecting 
about 44% (x 2 )> 31% (KS discrete) and 44% (joint distribution) which is much less than 66%, 
77% and 81%, respectively. We attribute this substantial difference to the non-zero mean of 
the process. Again, these processes don't satisfy the null hypothesis, so making the choice 
that gives the highest power seems most appropriate. The effect on the type 1 error appears 
negligible between these three choices. For the other processes we simulated, this choice always 
had superior power to the other two, although the difference was not always as large. 

It is also worth noting again that where we see significant rejections by the joint distribution 
test, they are due to primarily to dependence, not distribution. For the longer (n = 5000) 
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Figure 20. Crossing tree results for Feller's square root diffusion process with 
K = 8, a = 1, n = 0.2, n = 1250, 5 = 0.028276. Percentage of 10,000 sample 
paths rejected using distribution tests (left) and independence tests (right). Here, 
95% confidence intervals are 1% or smaller. 



k = 8, fi = 0.2, a = 1, n = 1250, 5 = 0.028276 




% of all (% of tested; # tested) 


levels 





1 


2 


3 


4 


X 2 (+2 df) 




9.2 


17.6 


0.2 ( 2.1; 849) 


0.0 ( NaN; 0) 


Twos Test 




5.6 


7.3 


2.4 ( 2.4; 9935) 


0.0 ( 0.5; 429) 


G Test 




6.6 


10.6 


10.9 ( 11.0; 9935) 


0.0 ( 0.9; 429) 


KS Discrete 




6.9 


12.2 


15.0 


0.1 


KLP98 Test 




4.2 


4.8 


6.9 


0.0 


Joint Dist. 




6.7 


10.1 


5.5 ( 14.3; 3871) 


0.0 ( NaN; 0) 


Autocorr. 




4.6 


4.8 


1.1 ( 1.3; 8523) 


0.0 ( 0.0; 9) 


Runs Test 




4.3 


3.5 


1.6 


0.0 


Larsen Test 




5.9 


5.3 


1.6 ( 1.8; 9365) 


0.0 ( 0.0; 350) 


Dix.-OBri. 




5.5 


5.3 


0.6 ( 0.6; 9935) 


0.0 ( 0.0; 429) 


OBH85 




5.3 


5.5 ( 5.5; 9993) 


1.0 ( 6.4; 1600) 


0.0 ( NaN; 0) 


Runs UD 


4.5 


3.8 


3.8 


0.0 


0.0 


Larsen UD 


4.8 


4.9 


7.0 


0.1 ( 0.1; 8905) 


0.0 ( 0.0; 261) 


Dix.-OBri. UD 


5.6 


5.7 


8.3 


0.0 ( 0.0; 9807) 


0.0 ( 0.0; 363) 


OBri85 UD 


5.4 


5.5 


3.9 ( 5.7; 6927) 


0.0 ( NaN; 0) 


0.0 ( NaN; 0) 



Table 12. Crossing tree results for Feller's square root diffusion process with 
K = 8, a = 1, n = 0.2, n = 1250, 5 = 0.028276. Percentage of 10,000 sample 
paths rejected, with an average of 1250 crossings in time 5. At higher levels, 
insufficient data length means some datasets are not tested. 



datasets and k = 6, at level 3 about 32% of all paths are rejected. After permuting the data, 
about 14% of paths are rejected. So, about 18% of all paths are rejected due to bivariate 
dependence. With k = 8 the difference is even more dramatic. At level 3 about 43% of paths 
are rejected, which falls to about 16% after permuting the data. Thus, 27% of the paths are 
rejected due bivariate dependence. That's about two-thirds of all the paths rejected by that 
test. With short (n = 1250) datasets, for both parameter values the differences between using 
permuted and non-permuted data are small, with both tests rejecting around the level of the 
test. For example, with k = 6 the number of rejections was comparable and in the range 4-8% 
of all paths at levels 1-3. 
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Figure 21. Crossing tree results for Feller's square root diffusion process with 
K = 8, a = I, n = 0.2, n = 5000, 5 = 0.028330. Percentage of 10,000 sample 
paths rejected using distribution tests (left) and independence tests (right). Here, 
95% confidence intervals are 1% or smaller. 



k = 8, (j. = 0.2, a-=l,n = 5000, 5 = 0.028330 




% of all (% of tested; # tested) 


levels 





1 


2 


3 


4 


5 


X 2 (+2 df) 




14.3 


48.9 


66.2 ( 66.2; 9994) 


0.0 ( 0.0; 33) 


0.0 ( NaN; 0) 


Twos Test 




10.6 


26.8 


17.8 


0.0 ( 0.1; 1319) 


0.0 ( 0.0; 18) 


G Test 




9.6 


36.0 


53.1 


0.0 ( 0.3; 1319) 


0.0 ( 0.0; 18) 


KS Discrete 




13.2 


39.6 


76.7 


0.3 


0.0 


KLP98 Test 




5.8 


8.0 


52.0 


0.0 


0.0 


Joint Dist. 




9.9 


24.1 


81.2 


0.0 ( 2.8; 108) 


0.0 ( NaN; 0) 


Autocorr. 




4.8 


7.2 


0.3 


0.1 ( 5.0; 160) 


0.0 ( NaN; 0) 


Runs Test 




4.7 


4.1 


11.1 


0.0 


0.0 


Larsen Test 




5.2 


5.4 


3.8 


0.0 ( 0.3; 1193) 


0.0 ( 0.0; 18) 


Dix.-OBri. 




5.1 


6.3 


2.6 


0.0 ( 0.2; 1319) 


0.0 ( 0.0; 18) 


OBri85 




5.3 


6.0 


10.6 ( 13.0; 8162) 


0.0 ( 7.7; 52) 


0.0 ( NaN; 0) 


Runs UD 


4.3 


4.6 


11.5 


0.4 


0.0 


0.0 


Larsen UD 


5.0 


5.2 


7.8 


0.8 ( 0.8; 9766) 


0.0 ( 0.0; 1156) 


0.0 ( 0.0; 12) 


Dix.-OBri. UD 


5.7 


7.0 


15.7 


0.8 


0.0 ( 0.0; 1286) 


0.0 ( 0.0; 14) 


OBri85 UD 


5.8 


6.4 


7.0 ( 7.0; 9981) 


0.0 ( 2.8; 109) 


0.0 ( NaN; 0) 


0.0 ( NaN; 0) 



Table 13. Crossing tree results for Feller's square root diffusion process with 
K = 8, a = I, n = 0.2, n = 5000, 5 = 0.028330. Percentage of 10,000 sample 
paths rejected, with an average of 5000 crossings in time 20. At higher levels, in- 
sufficient data length means some datasets are not tested. Here, 95% confidence 
intervals are 1% or smaller. 
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Figure 22. Rejection rates using sample quadratic variation for Feller's square- 
root diffusion. Short (n = 1250) datasets. Left has drift coefficient k = 6, right 
has drift coefficient k = 8. Here, 95% confidence intervals are 1% or smaller. 



QVresults.Feller.n5000tau20.1e4.k6 QVresults.Feller.n5000tau20.1e4.k8 




50 100 150 200 250 300 350 400 50 100 150 200 250 300 350 400 



Figure 23. Rejection rates using sample quadratic variation for Feller's square- 
root diffusion. Long (n = 5000) datasets. Left has drift coefficient k = 6, right 
has drift coefficient k = 8. Here, 95% confidence intervals are 1% or smaller. 



4.4. Fractional Brownian Motion. We simulated Fractional Brownian Motion (FBM), 
mainly as an example of a process with stationary increments that is not strong Markov. 
We simulated 1250 crossings by simulating long paths of Fractional Brownian Motion using 
the Wood and Chan algorithm (jWood and Chanl . ll994h . The method was used to produce a 
sequence {0, X±, . . . , X m }, m large, where Var(X(n)) = a 2 n 2H with a 2 = 1/250. With the 
transformation Xi = j H X% we use the self-similarity of FBM to produce a sequence, not on 
the positive integers, but at times separated by 7 = 10 -5 in our case. The choice of a 2 = 1/250 
allows direct comparison with our results for Brownian motion too, in that the Hurst parameter 
is different, but the variance is the same. Simulating long paths in this way, with a fine time 
resolution, we then find level crossings with a time accuracy of 10 -5 . We then did a search 
for the crossing size 5 that gave an average of 1250 crossings by time 5. The crossings for this 
S were used as the data for the crossing tree method. 

Three values for the Hurst parameter (H) were chosen: 0.7, 0.9, and 0.3. Values of H 
in (1/2, 1) provide long-range dependent increments, while values in (0, 1/2) provide "anti- 
persistent" increments. The results for 0.7 and 0.9 are largely similar. Amongst the distribution 
tests (Figures 1241 (left) and 1251 (left)), all but KLP98 are effective in rejecting nearly all the paths 
across a range of levels. The range is larger for higher H, suggesting that fewer subcrossings 
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FBM H=0.7 Distr. Test Results 




FBM H=0.7 Indep. Test Results 
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Figure 24. Crossing tree results for Fractional Brownian Motion with H = 0.7, 
n = 1250, 5 = 0.0010176. Percentage of 10,000 sample paths rejected using 
distribution tests (left) and independence tests (right). Here, 95% confidence 
intervals are 1% or smaller. 



H = 0.7, a = 1, n = 1250, 5 = 0.00101760 




% of all (% of tested; # tested) 


levels 





1 


2 


3 


4 


5 


X 2 (+2 df) 




100.0 


100.0 


87.4 


60.0 ( 60.6; 9903) 


3.2 ( 99.7; 325) 


Twos Test 




100.0 


100.0 


95.7 


61.2 


26.2 ( 26.2; 9999) 


G Test 




100.0 


100.0 


97.2 


69.0 


38.2 ( 38.2; 9999) 


KS Discrete 




100.0 


100.0 


97.2 


60.6 


26.2 


KLP98 Test 




19.7 


11.2 


6.8 


3.3 


0.2 


Joint Dist. 




100.0 


99.9 


82.0 


26.6 ( 26.6; 9998) 


3.1 ( 10.6; 2953) 


Autocorr. 




100.0 


99.8 


92.4 


68.9 ( 68.9; 9997) 


29.6 ( 34.4; 8598) 


Runs Test 




7.8 


5.4 


3.5 


1.9 


0.1 


Larsen Test 




7.6 


7.4 


6.5 


5.6 


3.2 ( 3.2; 9915) 


Dix.-OBri. 




18.6 


12.2 


8.5 


6.5 


0.3 ( 0.3; 9999) 


OBri85 




11.1 


8.6 ( 8.6; 9995) 


6.2 ( 7.1; 8779) 


1.6 ( 4.5; 3607) 


0.0 ( 0.0; 49) 


Runs UD 


28.6 


11.6 


4.6 


1.2 


0.0 


0.0 


Larsen UD 


2.6 


2.7 


2.8 


2.0 ( 2.1; 9880) 


0.1 ( 0.1; 7913) 


0.0 ( 0.0; 4073) 


Dix.-OBri. UD 


0.7 


1.4 


1.8 


0.4 


0.0 ( 0.0; 9591) 


0.0 ( 0.0; 6577) 


OBri85 UD 


5.6 


5.5 


4.4 ( 5.4; 8166) 


0.2 ( 2.1; 999) 


0.0 ( 0.0; 1) 


0.0 ( NaN; 0) 



Table 14. Crossing tree results for Fractional Brownian Motion with H = 0.7, 
n = 1250, 5 = 0.0010176. Percentage of 10,000 sample paths rejected, with an 
average of 1250 crossings in time 5. At higher levels, insufficient data length 
means some datasets are not tested. 



numbers are needed to reject when the dependence is stronger. Amongst the independence 
tests (Figures [241 (right) and [25] (right)), there is a slight difference between the comparison of 
the joint distribution and autocorrelation tests. For H = 0.7 the power of the autocorrelation 
test is similar to that of the distribution tests, while power of the joint distribution test decays 
much faster with increasing level. With H = 0.9, the power of both remains high- H = 0.9 
is just more easily rejected at higher levels. In comparison, Figure [28] shows that with our 
implementation of the quadratic method, the power is essentially the same, in that both the 
KS and CVM tests have near 100% rejection in the best case, althought for H = 0.7 the power 
would depend more critically on the value of c used. For H = 0.9 rejection is clearer, with near 
100% rejection for all values of c. 
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FBM H=0.9 Distr. Test Results 



FBM H=0.9 Indep. Test Results 
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Figure 25. Crossing tree results for Fractional Brownian Motion with H = 0.9, 
n = 1250, 5 = 0.000312. Percentage of 10,000 sample paths rejected using 
distribution tests (left) and independence tests (right). 



H = 0.9, a = 1, n = 1250, 5 = 0.000312 




% of all (% of tested: # tested) 


levels 





1 


2 


3 


4 


5 


X 2 (+2 df) 




100.0 


100.0 


100.0 


99.9 


93.4 ( 95.0; 9825) 


Twos Test 




100.0 


100.0 


100.0 


99.9 


94.5 


G Test 




100.0 


100.0 


100.0 


100.0 


96.3 


KS Discrete 




100.0 


100.0 


100.0 


100.0 


93.5 


KLP98 Test 




64.4 


37.2 


21.0 


11.3 


3.7 


Joint Dist. 




100.0 


100.0 


100.0 


99.1 


76.7 ( 76.8; 9993) 


Autocorr. 




99.9 (100.0; 9989) 


99.3 (100.0; 9929) 


97.1 (100.0; 9714) 


91.0 ( 99.8; 9116) 


73.4 ( 95.0; 7725) 


Runs Test 




27.3 


13.1 


6.6 


2.6 


1.3 


Larsen Test 




24.6 


18.0 


12.5 


7.1 


3.6 


Dix.-OBri. 




69.7 


40.1 


21.4 


11.8 


7.1 


OBri85 




41.1 ( 46.7; 8810) 


15.2 ( 24.1; 6305) 


4.6 ( 14.4; 3182) 


0.8 ( 7.3; 1064) 


0.1 ( 4.0; 224) 


Runs UD 


17.3 


8.3 


3.0 


0.5 


0.0 


0.0 


Larsen UD 


2.2 


1.9 ( 1.9; 9848) 


1.5 ( 1.6; 9458) 


0.7 ( 0.9; 8396) 


0.1 ( 0.1; 6283) 


0.0 ( 0.0; 3885) 


Dx.OBr UD 


0.8 


0.9 ( 0.9; 9935) 


0.5 ( 0.5; 9719) 


0.1 ( 0.1; 9165) 


0.0 ( 0.0; 7904) 


0.0 ( 0.0; 5707) 


OBri85 UD 


5.7 


3.7 ( 5.2; 7144) 


1.2 ( 4.5; 2659) 


0.0 ( 0.5; 211) 


0.0 ( 0.0; 1) 


0.0 ( NaN; 0) 



Table 15. Crossing tree results for Fractional Brownian Motion with H = 0.9, 
n = 1250, 5 = 0.000312. Percentage of 10,000 sample paths rejected, with an 
average of 1250 crossings in time 5. At higher levels, insufficient data length 



means some datasets are not tested. 



The anti-persistent case H = 0.3 shows a striking difference from the long-range dependent 
case. For all the tests, the power falls off dramatically with higher power. Amongst the dis- 
tribution tests, the KS discrete test decays slowest, while amongst the independence tests, the 
joint distribution test decays slowest. Based on level 0, 100% of the paths would be rejected 
which is higher than from our implementation of the quadratic variation method: 46% (KS) and 
65% (CVM). 

5. Analysis of foreign exchange rates 

In this section we show the results of applying the crossing tree-based test to foreign exchange 
rate tick data obtained from the Securities Industry Research Centre of Asia-Pacific (SIRCA). 
We have five series each representing a different exchange rate, for the period January to De- 
cember 2003: 

(1) Australian Dollar to US Dollar (AUD-USD), 

(2) Euro to UK Pound(EUR-GBP), 
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Figure 26. Crossing tree results for Fractional Brownian Motion with H = 0.3, 
n = 1250, 5 = 0.0174180. Percentage of 10,000 sample paths rejected using 
distribution tests (left) and independence tests (right). Here, 95% confidence 
intervals are 1% or smaller. 



H = 0.3, a = 1, n = 1250, S = 0.0174180 




% of all (% of tested; # tested) 


levels 





1 


2 


3 


4 


X 2 (+2 df) 




100.0 


50.0 ( 61.9; 8084) 


0.0 (100.0; 1) 


0.0 (100.0; 1) 


Twos Test 




99.9 


26.4 


0.0 ( 0.0; 7708) 


0.0 ( 0.2; 460) 


G Test 




100.0 


60.9 


0.6 ( 0.8; 7708) 


0.0 ( 0.2; 460) 


KS Discrete 




100.0 


80.5 


11.3 


0.0 


KLP98 Test 




38.6 


7.1 


0.0 


0.0 


Joint Dist. 




100.0 


46.4 ( 47.7; 9731) 


0.0 (100.0; 1) 


0.0 (100.0; 1) 


Autocorr. 




71.4 


3.3 ( 3.3; 9997) 


0.1 ( 4.6; 216) 


0.0 (100.0; 1) 


Runs Test 




8.8 


2.0 


0.0 


0.0 


Larsen Test 




4.5 


3.0 ( 3.0; 9899) 


0.0 ( 0.1; 3957) 


0.0 ( 0.0; 417) 


Dix.-OBri. 




3.1 


2.9 


0.0 ( 0.0; 7708) 


0.0 ( 0.0; 460) 


OBH85 




7.7 ( 7.7; 9997) 


1.9 ( 5.3; 3571) 


0.0 ( NaN; 0) 


0.0 ( NaN; 0) 


Runs UD 


100.0 


59.4 


2.4 


0.0 


0.0 


Larsen UD 


13.8 


16.9 


6.3 ( 6.8; 9310) 


0.0 ( 0.0; 2361) 


0.0 ( 0.0; 6) 


Dix.-OBri. UD 


99.0 


52.5 


2.8 ( 2.8; 9995) 


0.0 ( 0.0; 4594) 


0.0 ( 0.0; 7) 


OBri85 UD 


8.3 


6.2 


0.0 ( 0.4; 457) 


0.0 ( NaN; 0) 


0.0 ( NaN; 0) 



Table 16. Crossing tree results for Fractional Brownian Motion with H = 0.3, 
n = 1250, 5 = 0.0174180. Percentage of 10,000 sample paths rejected, with an 
average of 1250 crossings in time 5. At higher levels, insufficient data length 
means some datasets are not tested. 



(3) Euro to US Dollar (EUR-USD), 

(4) UK Pound to US Dollar (GBP-USD), 

(5) Japanese Yen to US Dollar (JPY-USD). 

As is common for considering finance data, we work with log-transformed data. Table [T7] lists 
some of the statistical properties of the increments of the log-transformed data. The sample 
mean is small in relation to the standard deviation, suggesting a negligible effect from drift can 
be expected. 
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5.1. AUD-USD. Table [181 shows the results from applying the crossing tree test to the AUD- 
USD data at levels to 7. Because these are results for one dataset, the numbers in the body 
of this table are quite different from those in the tables for the simulation data. Here we show 
p- values from applying tests at particular levels. Where our tests simply reject or not, without 
p-values, we show "< 0.05" to denote rejecting the null hypothesis and "> 0.05" where the null 
hypothesis is not rejected. Where a test rejects at 5% significance we also show the result in 
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Figure 27. Quadratic variation results for Fractional Brownian Motion with 
length 1250 and H = 0.7 (left) and H = 0.9 (right). Here, 95% confidence 
intervals are 1% or smaller. 
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Figure 28. Quadratic variation results for Fractional Brownian Motion with 
length 1250 and H = 0.3. Here, 95% confidence intervals are 1% or smaller. 



Series 


Length 


Sample 


Standard 


Sample 


Sample 






Mean 


Deviation 


Skewness 


Kurtosis 


AUD-USD 


450,641 


6.45xl0~ 7 


2.47xl0" 4 


0.2242 


33.3344 


EUR-GBP 


166,024 


4.81 xl0~ 7 


2.69xl0~ 4 


-1.0331 


63.2093 


EUR-USD 


6,086,353 


3.00 xl0~ 8 


1.15xl0~ 4 


-0.4466 


52.9049 


GBP-USD 


3,608,072 


2.86xl0" 8 


9.48xl0" 5 


-0.1496 


22.4587 


JPY-USD 


4,184,020 


-2.41xl0- 8 


1.05xl0" 4 


-0.2611 


56.8353 



Table 17. Statistical properties of the increments of the log-transformed 2003 
financial exchange rate tick data. 
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boldface. The number of subcrossing numbers available to the tests at each level is shown in 
the second row. 

A clear observation is that the tests of distribution and independence universally reject the 
null hypothesis at levels 1 and 2. Further investigation reveals the reason is the assumption of 
continuity in the null hypothesis. When the data are not exactly crossings already, crossings are 
found by linear interpolation between known data points. A large number of crossings between 
any two adjacent data points suggests 5 is small in relation to the vertical distance between the 
points. In these cases there won't be excursions (i.e., up-down or down-up pairs.) When a high 
percentage of the crossings arise in this way the tests will reject, essentially as a consequence 
of small S, an indication that the assumption of continuity of paths cannot be supported by 
the data. Recall, for example, that the subcrossings of level 1 are the crossings of level 0. As 
shown in Table [T9l at level 0, 34% of the crossings result from two or more crossings between 
data points, and 7.8% of the crossings result from 4 or more crossings between data points. 



AUD-USD 


levels (I) 





1 


2 


3 


4 


5 


6 


7 


# SubX 




99157 


26341 


6734 


1702 


415 


100 


25 


mean xing length 
level (/ - 1) 




1.2 min 


3.9 min 


14.6 min 


57 min 


3.8 h 


15.4 h 


63 h 


X 2 (+2 df) 




0.000 


0.000 


0.107 


0.808 


0.441 


0.628 


> 0.05 


Twos Test 




0.000 


0.000 


0.007 


0.645 


0.492 


1.000 


0.690 


G Test 




0.000 


0.000 


0.067 


0.734 


0.341 


0.648 


0.468 


KS 




< 0.05 


< 0.05 


< 0.05 


> 0.05 


> 0.05 


> 0.05 


> 0.05 


KLP98 




0.000 


0.000 


0.084 


0.468 


0.091 


0.603 


0.374 


Joint Dist. 




0.000 


0.000 


0.322 


0.865 


0.239 


0.576 


0.573 


Autocorr 




< 0.05 


< 0.05 


> 0.05 


> 0.05 


> 0.05 


< 0.05 


> 0.05 


Runs 




0.000 


0.000 


0.867 


0.676 


0.901 


0.055 


0.447 


Larsen 




0.000 


0.003 


0.223 


0.267 


0.463 


0.807 


> 0.05 


DixOB 




0.000 


0.000 


0.027 


0.712 


0.647 


0.941 


0.321 


OBH85 




0.000 


0.000 


0.004 


0.378 


0.545 


0.279 


0.034 


# UD pairs 


60946 


23237 


6421 


1652 


427 


102 


22 


4 


Runs UD 


0.000 


0.782 


0.080 


0.104 


0.893 


0.920 


1.000 


1.000 


Larsen UD 


0.969 


0.002 


0.394 


0.863 


0.519 


0.678 


> 0.05 


> 0.05 


DixOB UD 


1.000 


0.003 


0.109 


0.983 


0.422 


0.626 


0.510 


> 0.05 


OBH85 UD 


0.000 


0.000 


0.556 


0.724 


0.233 


0.785 


0.635 


NA 



Table 18. Crossing tree test results for 2003 AUD-USD exchange rate tick data. 
Distribution and independence tests clearly reject the null hypothesis at levels 
1 and 2, most likely because the data does not support the continuous path 
assumption at those scales. 



At higher levels, apart from the a couple of anomalous results, the tests do not reject the null 
hypothesis at levels 4 and above. Level 3 is clearly the transition scale. A decision to reject 
level 3 depends on how one treats the tests. Our x 2 > joint distribution autocorrelation and G 
tests do not reject. Our simulation results show that our x 2 an d joint distribution tests are the 
most powerful in the absence of drift. With drift, the G test and autocorrelation tests were 
the most powerful. Moreover, only with Feller's square root diffusion process was the power 
of the discrete KS test noteworthy, and even then it was second (or third) most powerful after 
our x 2 an d joint distribution test. Perhaps the key here is that the Twos test rejects at level 3 
suggesting the number of twos in the subcrossings is unusually high or low. A high percentage 
of twos is consistent with an effect of multiple crossings from interpolation, suggesting that 
rejecting level 3 is appropriate on account of small 5. 
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levels 





1 


2 


3 


4 


5 


6 




> 2 


> 4 


> 2 


> 4 


> 2 


> 4 


> 2 


> 4 


> 2 


> 4 


> 2 


> 4 


> 2 


> 4 


AUD-USD 


34.2 


7.8 


8.8 


0.8 


1.1 


0.4 


0.6 


0.3 


0.5 


0.3 














EUR-GBP 


48.6 


14.8 


15.3 


2.4 


3 


1 


1.5 


0.7 


1.2 


0.9 


0.7 











EUR-USD 


44 


11 


13.6 


0.7 


1.2 


0.2 


0.6 


0.4 


0.8 


0.5 


1 


0.3 


0.6 





GBP-USD 


50.7 


18.5 


22 


2.3 


3.2 


0.3 


0.8 


0.4 


0.7 


0.3 


0.5 











JPY-USD 


44.6 


13 


16.6 


2 


3 


0.3 


0.7 


0.4 


0.8 


0.5 


0.6 


0.3 









Table 19. Percentage of crossings arising as either > 2 crossings (left column 
for each level) or > 4 crossings (right colunn for each level) between pairs of data 
points. High percentages suggest the data doesn't support the continuity as- 
sumption. 



Row 3 of Table [TBI shows how the levels correspond to mean crossing times. For example, the 
subcrossings of level 1 are determined using the crossings of level 0. Since there are are 320,810 
level crossings, and approximately 6,396 hours for trading currency in a year, the mean length 
of a level crossing is about 1.2 minutes. So we say the results of level 1 correspond to an 
average crossing length of 1.2 minutes. Since our tests do not reject above level 3, we would say 
the crossing tree technique does not reject the null hypothesis at a time scale between 15 and 
57 minutes and above. 

One might ask if the consistent rejection of the tests at lower levels is simply because of the size 
of the datasets tested. For example, at level 1, 99,157 numbers are tested in the distribution and 
independence tests. As a simple check for this, we split the data into non-overlapping blocks 
of length 1024 and applied a subset of our tests to the separate blocks. Partial blocks were 
discarded. So, for the AUD-USD data this yields 96 blocks of length 1024 at level 1. If, for 
a test, the results from Table [TBJ are obtained consistently from the blocks it would be strong 
evidence that the results for lower levels in that table are not an artifact of data length. For 
our x 2 test, 95 of 96 blocks are rejected. The numbers for some other tests are similar: Twos 
test (96), G test (95), KS discrete (96) and joint distribution test (94). Of the tests we tried, 
the autocorrelation test rejected the fewest of the blocks, rejecting 81 out of 96. These results 
are consistent across all five of our exchange rate datasets, including the outlier behaviour of 
the autocorrelation test rejecting from 31% (EUR-USD) to 100% (EUR-GBP) of the blocks. 
Beyond being a curiosity, the behaviour of the autocorrelation test here doesn't affect our results 
since the other tests consistently reject almost all blocks. As such, we won't comment more on 
these block results. 

5.2. EUR-GBP. Table [2D] shows results for the crossing tests applied to the EUR-GBP ex- 
change rate data. This dataset is about one third of the length of the AUD-USD dataset, but 
the number of subcrossings at level one is about half. This difference is easily explained by the 
higher percentage of multiple crossings arising from the interpolation between data points in the 
EUR-GBP data, as shown in Table [T9l It appears levels 3 and below reject because continuity 
isn't supported. 

Level 4 is a transition level. Rejection by our x 2 test is noteworthy, and one might reject the 
null at level 4 on the basis of this test. However, the result from the Twos test doesn't suggest 
the number of twos in the subcrossings is unusual, suggesting the rejection by the x 2 test is due 
to more than just small 5. This is further supported by the observation that the percentage of 
multiple crossings from interpolation is not very different between levels 4 and 5, yet several 
tests reject at level 4 and none reject at level 5. Again it seems there are additional features 
in the data causing the tests to reject at level 4. Finally, no tests reject at level 5, making the 
results at level 6 more meaningful- 5 is clearly not too small. From the results of levels 4 and 
6, one cannot clearly say continuous time-changed Brownian motion is not rejected above some 
fixed level. Thus, the use of a continuous time-changed Brownian motion above some threshold 
level seems unsupported for this dataset. 
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EUR- GBP 


levels (i) 





1 


2 


3 


4 


5 


6 


7 


# SubX 




52775 


15728 


4245 


1030 


253 


80 


20 


mean xing length 

level (1 — ~\\ 
it; vci it 1 1 




2.6 min 


7.3 min 


24.4 min 


1.5 h 


6.2 h 


25 h 


79 h 


X 2 (+2 df) 




0.000 


0.000 


0.000 


0.012 


0.930 


0.181 


> 0.05 


Twos Test 




0.000 


0.000 


0.000 


0.334 


0.530 


0.018 


0.824 


G Test 




0.000 


0.000 


0.000 


0.026 


0.891 


0.034 


0.598 


KS 




< 0.05 


< 0.05 


< 0.05 


> 0.05 


> 0.05 


< 0.05 


> 0.05 


KLP98 




0.000 


0.000 


0.000 


0.031 


0.542 


0.675 


0.359 


Joint Dist. 




0.000 


0.000 


0.000 


0.172 


0.120 


0.251 


0.093 


Autocorr 




< 0.05 


< 0.05 


> 0.05 


> 0.05 


> 0.05 


> 0.05 


> 0.05 


Runs 




0.000 


0.013 


0.382 


0.212 


0.924 


0.909 


0.110 


Larsen 




0.000 


0.000 


0.003 


0.003 


0.201 


< 0.05 


> 0.05 


DixOB 




0.000 


0.000 


0.605 


0.190 


0.700 


0.373 


0.187 


OBri85 




0.000 


0.000 


0.091 


0.024 


0.573 


0.521 


0.845 


# UD pairs 


22175 


10533 


3708 


1092 


242 


46 


20 


8 


Runs UD 


0.000 


0.000 


0.001 


0.665 


0.655 


0.891 


0.484 


0.057 


Larsen UD 


0.674 


0.999 


0.173 


0.392 


0.256 


> 0.05 


< 0.05 


> 0.05 


DixOB UD 


1.000 


1.000 


0.985 


0.254 


0.641 


0.135 


> 0.05 


> 0.05 


OBri85 UD 


0.001 


0.496 


0.036 


0.230 


0.814 


0.203 


0.201 


NA 



Table 20. Crossing tree test results for 2003 EUR-GBP exchange rate tick data. 
Distribution and independence tests clearly reject the null hypothesis at levels 1 
to 3: the data does not support the continuous path assumption at those scales. 
Results at levels 4 and 6 seem to reject a continuous chronometer. 



5.3. EUR-USD. Table [21] shows results from applying the crossing tree tests to the EUR- 
USD exchange rate data. This dataset is the largest of our five. Levels 1 to 4 show the same 
consistent rejection explained above for small 5. Level 5 appears to be the transition scale. The 
null hypothesis cannot be rejected for levels 6 and above, corresponding to a timescale of longer 
than 48 minutes. 

5.4. GBP-USD. Table [22] shows results from applying the crossing tree tests to the GBP-USD 
exchange rate data. Levels 1 to 3 show the same consistent rejection explained above for small 
5. Many tests reject at level 4, but notably the Twos test does not. This suggests the number 
of twos is not significant, and multiple crossings from interpolations between datapoints is not 
a dominant feature. Thus, rejection by the tests appears indicative of some other features 
in the data such that the null hypothesis is not satisfied at that scale. At higher levels, the 
autocorrelation test result at level 6 appears anomalous. Ignoring it, the null hypothesis cannot 
be rejected at levels 5 and above, corresponding to timescales larger than a cutoff between 11 
and 46 minutes. 

5.5. JPY-USD. Table [23] shows results from applying the crossing tree tests to the JPY-USD 
exchange rate data. Levels 1 to 4 show the same consistent rejection explained above for small 
5. The null hypothesis cannot be rejected for levels 5 and above, so timescales larger than a 
cutoff between 12 and 52 minutes. At level 7 the Runs test shows a significant result that we 
dismiss, since it is an isolated result and the p-value is very close to 0.05 anyways. 

6. Conclusion 

Using a recently proposed characterisation of Brownian motion using the crossing tree, we 
have proposed a new test for continuous local martingales, equivalently a Brownian motion 
time-changed by a continuous chronometer. It is particularly well-suited to tick-by-tick data, 
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EUR-USD 


levels (I) 





1 


2 


3 


4 


5 


6 


7 


8 


# SubX 




1092577 


198585 


34804 


7981 


1916 


492 


120 


32 


mean xing length 
level Q - 1) 




0.1 mill 


0.4 min 


1.9 min 


1 1 min 


48 min 


3.3 h 


13 h 


53 h 






0.000 


0.000 


0.000 


0.000 


0.020 


0.669 


0.915 


> 0.05 


Twns Test 




0.000 


0.000 


0.000 


0.013 


0.002 


0.822 


0.927 


1.000 


G Test 




0.000 


0.000 


0.000 


0.000 


0.008 


0.513 


0.830 


0.316 


KS 




< 0.05 


< 0.05 


< 0.05 


< 0.05 


< 0.05 


> 0.05 


> 0.05 


> 0.05 


KLP98 




0.000 


0.000 


0.000 


0.000 


0.015 


0.306 


0.725 


0.890 


Tnint T~)i«i't" 

» J Ullll J / lijtt 




0.000 


0.000 


0.000 


0.000 


0.119 


0.891 


0.765 


0.934 






< 0.05 


< 0.05 


< 0.05 


< 0.05 


> 0.05 


> 0.05 


> 0.05 


> 0.05 


R.uiis 




0.000 


0.000 


0.000 


0.060 


0.511 


0.622 


0.645 


0.862 


Laxsen 




0.000 


0.000 


0.434 


0.089 


0.377 


0.602 


0.284 


> 0.05 


DixOB 




0.000 


0.000 


0.000 


0.155 


0.899 


0.340 


0.761 


0.752 


OBri85 




0.000 


0.000 


0.000 


0.205 


0.251 


0.353 


0.314 


0.820 


# UD pairs 


944811 


347117 


64832 


9308 


2038 


480 


126 


28 


9 


Runs UD 


0.000 


0.000 


0.000 


0.000 


0.735 


0.699 


0.054 


0.879 


1.000 


Larsen UD 


0.000 


0.052 


0.046 


0.376 


0.437 


0.250 


0.266 


< 0.05 


> 0.05 


DixOB UD 


0.009 


0.000 


0.000 


0.000 


0.748 


0.270 


0.863 


0.088 


> 0.05 


OBH85 UD 


0.000 


0.000 


0.000 


0.001 


0.619 


0.144 


0.578 


0.023 


NA 



Table 21. Crossing tree test results for 2003 EUR-USD exchange rate tick 
data. Distribution and independence tests clearly reject the null hypothesis at 
levels 1 to 5, most likely because the data does not support the continuous path 
assumption at those scales. The null hypothesis cannot be rejected at levels 6 
and above. 



which is not at regularly spaced time intervals. An advantage is the ability to test for continuous 
local martingales at a range of timescales, including an ability to test whether the data supports 
(in a statistical sense) the hypothesis of continuous paths. 

An alternative test uses the sample quadratic variation (realised volatility), but suffers from 
issues (e.g., market microstructure noise) that give the sample quadratic variation poor sta- 
tistical properties, especially at high frequencies. Further, formal statistical tests require the 
choice of an arbitrary parameter. As we show, this choice could have a large impact on the test 
results. Use of the crossing tree characterisation avoids these problems. 

From simulation results for a range of diffusions, use of the crossing tree generally shows equal 
or higher discriminatory power than the quadratic variation approach for a similar number of 
observations. Although, it notably lacks power to detect drift. 

From testing high frequency foreign exchange rate data for five rates, we show at small 
timescales (typically below about 15 minutes) the data doesn't support the hypothesis of a 
continuous local martingale. At larger timescales, with the exception of the EURGBP exchange 
rate, the continuous local martingale assumption cannot be rejected. In the case of EURGBP, 
some larger timescale s are rejected while others are not. This extends the earlier conclusions of 
I Andersen et al. ( 2000l ) who normalised daily foreign exchange rate returns by realised volatilities 



(formed by summing 30 minute return volatilities) and observed standard Normal values. 

Appendix A. Simulation techniques 

We consider here the problem of simulating a stochastic process at the times TP, k = 0, 1, 

That is, simulating the crossing points X(TP,) and crossing lengths W®. 

A.l. Ge neral Results for D iffusion processes. For diffusion processes we can use the scale 
function ( Revuz and Yor . 19991 . p. 278-290) to simulate the sequence {X(T®)}k>o- Let W de- 



note Brownian motion. Suppose X is a process described by dX{t) = A(X{t))dt+B(X(t))dW{t) 
satisfying the following conditions. 
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GBP-USD 


levels (/) 





1 


2 


3 


4 


5 


6 


7 


8 


# SubX 




735365 


167751 


34371 


8354 


2053 


504 


130 


31 


mean xing length 
level (I - 1) 




0.2 min 


0.5 min 


2.3 min 


11.2 min 


46 min 


3.1 h 


12.7 h 


49 h 


v 2 (-\-2 dfl 




0.000 


0.000 


0.000 


0.000 


0.848 


0.140 


0.712 


> 0.05 


Twos Test 




0.000 


0.000 


0.000 


0.784 


0.377 


0.624 


0.539 


1.000 


G Test 




0.000 


0.000 


0.000 


0.009 


0.859 


0.112 


0.386 


0.727 


KS 




< 0.05 


< 0.05 


< 0.05 


> 0.05 


> 0.05 


> 0.05 


> 0.05 


> 0.05 


KLP98 




0.000 


0.000 


0.000 


0.000 


0.460 


0.678 


0.238 


0.675 


i > Ullll' J J ± iJ Lf • 




0.000 


0.000 


0.000 


0.265 


0.670 


0.184 


0.967 


0.522 


A ntfirnrr 




< 0.05 


< 0.05 


< 0.05 


< 0.05 


> 0.05 


< 0.05 


> 0.05 


> 0.05 


R,uns 




0.000 


0.000 


0.000 


0.140 


1.000 


0.053 


0.690 


1.000 


Larsen 




0.000 


0.000 


0.049 


0.000 


0.834 


0.856 


0.185 


> 0.05 


DixOB 




0.000 


0.000 


0.000 


0.007 


0.190 


0.656 


0.289 


0.584 


OBH85 




0.000 


0.000 


0.000 


0.024 


0.388 


0.112 


0.817 


0.495 


# UD pairs 


486032 


199187 


49413 


8831 


2166 


522 


122 


34 


4 


Runs UD 


0.000 


0.009 


0.000 


0.000 


0.452 


0.149 


0.001 


0.093 


1.000 


Larsen UD 


0.000 


0.000 


0.005 


0.297 


0.459 


0.721 


0.063 


< 0.05 


> 0.05 


DixOB UD 


1.000 


0.000 


0.000 


0.000 


0.861 


0.344 


0.989 


0.182 


> 0.05 


OBH85 UD 


0.000 


0.000 


0.000 


0.004 


0.820 


0.767 


0.126 


0.082 


NA 



Table 22. Crossing tree test results for 2003 GBP-USD exchange rate tick 
data. Distribution and independence tests clearly reject the null hypothesis at 
levels 1 to 3, most likely because the data does not support the continuous path 
assumption at those scales. On the other hand, the null hypothesis cannot be 
rejected at levels 5 and above. 



(1) The paths of X are continuous. 

(2) The state space E has form [l,r], (l,r], [l,r) or (l,r) where — oo <l<r< oo. 

(3) The process is regular. That is, for any y in the interior of E, I < x, and y < r, 
P[T(y) < oo|X(0) = x) > where T(y) is the hitting time for y. 

(4) Both A and B are locally bounded Borel functions and B(-) > 0. 

Fix xq in the interior of the range of X, then let the scale function s be a solution of 

^-s(x) = exp (-2 f X (A(u)/B 2 (u))du) . 
Take x £ <5Z. Then, provided [x — 6, x + S] is in the interior of the range of X, we have 

(7) Ps (x) = F(X(T° k+1 ) =x + 5 | X(T° k ) = x) = ~ s{ ? ~ 5) . 

s(x + o) — s{x — 0) 

Simulating the cros sing lengths W® is harder, though can be done in some cases. See 
Burq and Joned ( 20081 ) for the case of Brownian motion. However, using the speed measure 



we can get an expression for the expected crossing length. Put 

m{dx) = 2B~ 2 (x) exp |2 J" ' (A(u) / 'B 2 (u))dvj dx 

then, provided [x — 5, x + 5] is in the interior of the range of X, we have (jRevuz and Yorl . 11999 . 
p. 304), 

E « +1 |*(T»)=x) = 00^f\ s(l + S)-s(y) )mm 



s(x + 5) — s(x) 
s(x + 5) — s(x — 5) 
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(s(y) - s(x - 5))m(dy). 



JPY-USD 


levels (I) 





1 


2 


3 


4 


5 


6 


7 j 


8 


# SubX 




829941 


171042 


33165 


7404 


1810 


435 


107 


24 


mean xing length 
level (I - 1) 




0.1 min 


0.5 min 


2.2 min 


12 min 


52 min 


3.5 h 


14.7 h 


59 h 


v 2 (A-1 dfl 

A / 




0.000 


0.000 


0.000 


0.000 


0.122 


0.689 


0.951 


> 0.05 


Twns Test 




0.000 


0.000 


0.000 


0.000 


0.359 


0.701 


0.699 


0.839 


G Test 




0.000 


0.000 


0.000 


0.000 


0.188 


0.672 


0.857 


0.885 


KS 




< 0.05 


< 0.05 


< 0.05 


< 0.05 


> 0.05 


> 0.05 


> 0.05 


> 0.05 


KLP98 




0.000 


0.000 


0.000 


0.000 


0.115 


0.825 


0.962 


0.770 


Tnint T~)i«i't" 




0.000 


0.000 


0.000 


0.000 


0.402 


0.499 


0.077 


0.610 


A litnrnrT 1 




< 0.05 


< 0.05 


< 0.05 


< 0.05 


> 0.05 


> 0.05 


> 0.05 


> 0.05 


R.uns 




0.000 


0.000 


0.000 


0.329 


0.640 


0.182 


0.048 


0.502 


Larsen 




0.000 


0.437 


0.229 


0.379 


0.826 


0.117 


0.675 


> 0.05 


DixOB 




0.000 


0.000 


0.000 


0.062 


0.570 


0.121 


0.885 


0.742 


OBH85 




0.000 


0.000 


0.000 


0.000 


0.621 


0.694 


0.299 


0.653 


# UD pairs 


651261 


244039 


52356 


9173 


1971 


452 


104 


29 


7 


Runs UD 


0.000 


0.000 


0.000 


0.000 


0.065 


0.173 


0.186 


0.449 


0.914 


Larsen UD 


0.000 


0.052 


0.216 


0.059 


0.592 


0.328 


0.038 


> 0.05 


> 0.05 


DixOB UD 


0.000 


0.000 


0.000 


0.000 


0.106 


0.692 


0.872 


0.664 


> 0.05 


OBH85 UD 


0.000 


0.000 


0.000 


0.000 


0.523 


0.998 


0.617 


0.032 


NA 



Table 23. Crossing tree test results for 2003 JPY-USD exchange rate tick data. 
Distribution and independence tests clearly reject the null hypothesis at levels 
1 to 4, most likely because the data does not support the continuous path as- 
sumption at those scales. The null hypothesis cannot be rejected at levels 5 and 
above. 



A. 2. Brownian motion with drift. Suppose that dX{t) = adt + dW{t) for a / 0. Put 
xq = then we get s(x) = — (2a)~ 1 e~ 2ax and m{dx) = 2e 2ax dx, whence 

F(X(T° +1 )=x + 5\X(T°) 

(8) E(W° +1 \X(T°) 

A. 3. Ornstein-Uhlenbeck process. Suppose that dX{t) = —aX(t)dt + adW(t) with a, a > 
0. Put xq = then we get s'(x) = e ax l a and m(dx) = (2/o 2 )e~ ax l a dx, whence 

F(X(T° k+1 ) = X + 8 1 X(T°) = x) = J f 7+ S s eauV ^ du = P(*) say 

and 

E(W£ +1 | X(T fc °) = x) = (2/a 2 )p(x) e ^ 2 -y 2 )l° 2 dz dy 

J x J y 

+ (2/a 2 )(l-p{x)) f X f V e a(z2 - y2)/a2 dzdy 
J x—8 J x—5 

= w{x) say. 

With X(0) ~ N(0,a 2 /(2a)), {X(t), t > 0} is stationary. Thus, for Y k = X(T fe °), {Y k , k = 
0, 1, . . .} is a positive recurrent random walk on <5Z, defined by ¥(Y(t + 1) = y + 5 \ Y(t) = y) = 
p{y) and ¥(Y(t + 1) = y — S | Y(t) = y) = 1 — p{y). Let 7r be the stationary distribution of 
Y. Then, to simulate the crossings of the Ornstein-Uhlenbeck process, we take Yq as a value 
from the stationary distribution ir and then simulate according to the Markov chain described 
by p(x). 
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A.4. Feller's Square Root Diffusion. Suppose that dX(t) = k(jj, - X(t))dt + <Ty/X(t)dW(t) 
for k, /i, (7 > 0, 2k/x/cj 2 > 1 and X(0) has the stationary Gamma(o, b) distribution with shape 
parameter a = 2K^/a 2 and scale parameter b = a 2 /(2k). For this range of parameter values 
the process is stationary with sample space (0, oo). 

Put xo = 5 then we get s'(x) = (x/S)~ 2K ^ a X s-5 ) and thus for x = nd, x > 25, we 

have 

P(X(T° +1 ) = x + 5 | Xtf) = x)= J - 5 s u _ 2K ^ e{2K/(72)udu = P<*) Bay. 

For x = S we have, since the process never visits 0, F(X(T k+1 ) = 26 \ X(T k ) = 5) = 1. 

In practice, to simulate {X(T$), k = 0, 1, . . .} we first obtain the first hitting time Tq and 
crossing point Yq = X(Tq). We do this by si mulating the discrete approx imate path {X k , k = 
0, 1, . . .}, using the order 1.0 Milstein scheme ( Kloeden and Platen! . 1992, p. 335), at regularly 



spaced points with interval A using 

°\Zl - A) 



(9) X k+l =X k + k{h - X k )A + a^X k Z k + ( 

Here Xq is a random Gamma(a, b) value from the stationary distribution of {X(t)}, and 
{Z\, Z2, ■ ■ ■} is an i.i.d. sequence of N(0, A) values. With Yq and p(x), the needed cross- 
ings can be simulated directly using the Markov chain. Although X(Tq) is the first hit of the 
lattice, we do not use it as a crossing. 

A. 5. Other Processes. For other processes, where a method to simulate the crossings directly 
may be unknown, it is still possible to simulate X(T£) and T®, k = 0, 1, . . . if sufficiently long 
sample paths can be simulated. In effect, one simulates a sample path at regularly spaced times, 
sufficiently close to minimize interpolation errors, and then determines where the crossings occur 
directly. The Matlab program 'gethits.m' does this, citejonesxx 

Appendix B. Determination of 5 

In this section we discuss how to determine the scale size 5 to simulate a process with, 
on average, a specified number iV of level crossings in a specified time interval, typically 
(Tq, Tq + to), where to is a specified constant. In many cases Tq = and so the interval is just 
(0,to). We make this assumption here unless otherwise stated. 

B. l. Brownian motion. A consequence of the strong Markov, self-s imilarity and sta t ionar y 
increment properties of Brownian motion is that = 5 2 for all k ( Burq and Jones . 20081 ). 



On average we want intervals of length to /N and so 5 = y/to/N. 

B.2. Brownian motion with drift. As shown in ([8]), the crossing lengths are constant. That 
is, they do not depend on where the process is. It is sufficient to solve 

5(e 2a6 - 1) _ to 
a(e 2aS + 1) ~ N 

for 5. 

B.3. Ornstein-Uhlenbeck process. For a given 5 > 0, let tt be the equilibrium distribution 
of the Markov chain {Y k , k = 1, . . .} in Section [A. 31 for the Ornstein-Uhlenbeck process with 

dX(t) = -aX(t)dt + adW(t), a, a > 

and 

p(x) = F(X(T° k+1 ) =x + 5\ X(T° k ) = x 



i:_ s e^^ 2 du 



jX+S eau 2 /a 2 du 



Then 

00 

(10) MW£)= £ ir(n5)w(n5) 



n=— 00 
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and we need to find 5 such that ¥, n (W®) = to/N, essentially by enlightened trial-and-error to 
obtain sufficient accuracy and precision. Here, by E n we indicate expectation conditioned on Y± 
having initial distribution tt on <5Z, so that {Y^, k = 0, 1, . . .} is stationary. For the particular 
goal of K n (W®) = 0.004, 5 was found for two combinations of parameters. For a = 8, a = 1, 
5 = 0.063015 was found to give an absolute relative error in E 7r (VF^ ) ) of less than 1.05 in 10,000, 
or about 1 point in 10,000. Figure I2U1 illustrates the empirical distributions of Yi, Y2, Y20, Yioo> 
and ilooo f° r these parameters, using 100,000 sample paths are virtually indistinguishable from 
the stationary distribution, justifying use of the stationary distribution. For a = 10, a = 1, 
5 = 0.062945 was found to give an absolute relative error in E^Wj?) of less than 0.2 in 10,000. 




Figure 29. Distribution of empirical passage times for the Ornstein-Uhlenbeck 
process for 5 = 0.063015, a = 8, a = 1 using 100,000 Markov chains. They 
appear indistinguishable from the stationary distribution. 



B.4. Feller's Square Root Diffusion. Suppose that dX(t) = k((1 - X(t))dt + <jy/X(t)dW(t) 
for K,fj,,a > 0, 2n/j,/a 2 > 1 and -X"(0) has the stationary Gamma(a, b) distribution with shape 
parameter a = 2nfi/a 2 and scale parameter b = a 1 /(2k). Although this is also a stationary 
diffusion, determination of 5 must proceed quite differently from the use of ()10p above. We can 
again think in terms of the Markov chain Yfc = X(TP), {Y^, k = 0,1,...}. We can obtain an 
integral formula for E(W° +1 | X(T$) = x) for x = kS, k > 2 but not for x = 5. It is possible 
to estimate this latter value by simulation to approximate w. Since we have all the transition 
probabilities for the Markov chain, calculation of the stationary distribution ir is straightforward 
and we can find 

00 

n=—oo 

The key problem is the Markov chain is not stationary, even though the underlying process is 
stationary, and E 7r (W ( ^ J ) isn't terribly helpful. A reason for this somewhat surprising result is 
easy to see when parameter values are used. For example, consider the case of 5 = 0.0220335, 
k = 6, and \i = 0.2. Since we know the stationary Gamma distribution we have Pr(Yo = 5) < 
Pr(X(0) < 5) = 0.01144. On the other hand, for the stationary distribution Pr(vr = S) = 0.0029 
to four digits. Then the following result applies. 

Lemma 8. Assume Yq does not have the stationary distribution. Then does not have the 
stationary distribution for all k > 0. 

Proof. We'll show that Y\ cannot have the stationary distribution, but the same argument 

easily extends to Yfc-i and Yp.. Assume Y\ = ir. Let P be the transition matrix for the Markov 
chain. Then {lfc/<5, k = 0, 1, . . .},with state space {1,2,...}, is the embedded Markov chain 
for some continuous-time process {V(t), t > 0} with exponential holding times, different from 
{X(t), t > 0}, which need not have exponential holding times. Since the transitions of the 
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Markov chain are always ±1, {V(t), t > 0} is a birth and death process. Mean reversion of 
{X(t), t > 0} implies k = 0,1,...} is positive recurrent, and so ergodic. This means 

{Yfc/<5, k = 0, 1, . . .}, and so {Y^, k = 0, 1, . . .}, is time reversible. Let Q be the transition matrix 
for the time reversed Markov chain. Since tt is the stationary distribution of the time reversed 
chain too, 

Y = QYi = Qn = 7r. 

But this violates the assumption that Yq does not have the stationary distribution. Hence, Y\ 
cannot have the stationary distribution. □ 

(Note that a possibly simpler argument would say Y\ = tt and so 7r = Y± = PYq while Pit = tt. 
Combining these two equations gives PYq = Pit and so Y\ = tt. However this requires that P 
be invertible, which is difficult to establish.) 

Figure l30l illustrates the empirical distributions of Y\, Y2, Y20, Yioo> and Y1000 using 100,000 
simulated Markov chains. The distributions are clearly different from each other, and from 
the stationary distribution, even for the 1000th passage time. This suggests calculations based 
on the stationary distribution of the crossing times are of limited use for Feller's square root 
process. 




5 10 15 20 25 30 35 40 45 50 



Figure 30. Distribution of empirical passage times for Feller's square root pro- 
cess for 5 = 0.0220335, [i = 0.2, k = 6, and a = 1 using 100,000 simulated 
Markov chains. The distributions appear quite different, indicating the Markov 
chain is not stationary. 

Since the Markov chain at passage times is not stationary it was decided to simulate ap- 
proximate paths of sufficient length using ([9]) and trial-and-error to determine 5 as needed. For 
example, with k = 6, a = 1, to put 1250 crossings in the interval (T$,Tq + 5) (dropping the 
first passage time), 5 = 0.027100 was obtained using a time step A = 10~ 5 and 5,000 sample 
paths. However, other values of A yielded different values for 5, which were significant consid- 
ering the precision obtained. In effect, the goal is to estimate 5 as A — > 0, although simulation 
time quickly becomes impractical as A gets smaller. Regression and extrapolation were used to 
improve the estimate. Let m = — log 10 (A). Then the simulations provides estimates 5 m using 
A = 10 _m . Since log 10 (e) m+ i — 5 m ) appears quite linear, linear least-squares regression was 
used to estimate a and b in the recursive form <5 m +i = S m + lo am+fe , m > 3 where 63 is known. 
Figure ED shows simulation results with 5 m for k = 6, a = 1 to obtain either 1250 crossings 
in an interval of length 5 or 5000 crossings in an interval of length 20. The results are quite 
similar. This is not surprising. As the number of crossings increases, the distributions of the 
crossings Yju get closer to the stationary distribution and the non-stationary behaviour of for 
k small is less dominant. The fitted line provides an improved estimate = 0.028163 for 1250 
crossings. 

50 



k=6, n=1250 
k=6, n=5000 



5 6 7 

-tog.JAJ 



Figure 31. Determination of S for the Feller square root process with n = 6, 
(7 = 1. The results for 1250 and 5000 crossings are similar. The fitted curve is 
for 1250 crossings, and provides an improved estimate for S. 



B.5. Other Processes. For other processes, determining S to achieve N crossings on average 
in the interval (0, to) can be accomplished with a trial and error approach. Simulate a large 
number of independent sample paths (say M) with a time resolution sufficiently fine to minimize 
interpolation errors. Choose a value S* as a possible value for S. For sample path i = 1, . . . , M 
determine the crossings and find the time Tjy i of the iV-th crossing time. If the sample mean 

Tpf = Yli=i T% i/M is sufficiently close to to (e-g-, to is within the confidence interval for Tjy) , 
then 5* can serve as the desired 5. If not, adjust the candidate value 5* larger or smaller and 
try again. This can be combined with other root-finding or regression techniques to find 5 with 
fewer iterations. 
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